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The  Marvels  of  Electromagnetic  Band  Gap  (EBG)  Structures 

Y.  Rahmat-Saraii 

Electrical  Engineering  Department 
University  of  California  at  Los  Angeles 
Los  Angeles,  CA,  90095-1594,  U.  S.  A. 
rahmat@ee.ucla.edu 


1.  INTRODUCTION 

Artists  and  scientists  alike  have  been  fascinated  by 
the  existence  of  periodic  structures  in  nature. 
When  these  structures  interact  with 
electromagnetic  waves  many  unique  features 
result.  Observables  are  characteristics  such  as 
frequency  stop-bands,  pass-bands,  and  band-gaps. 
Various  terminology  have  been  used  to  classify 
these  structures  depending  on  the  domain  of  the 
applications  in  filter  designs,  gratings,  frequency 
selective  surfaces  (FSS),  photonic  crystals  and 
band-gaps  (PEG),  etc.  We  prefer  to  classify  them 
under  the  broad  terminology  of  “Electromagnetic 
Band-gaps  (EBG)”.  Recently,  many  researchers 
have  adopted  this  terminology.  Broadly  speaking, 
EBG  structures  are  3-D  periodic  objects  that 
prevent  the  propagation  of  the  electromagnetic 
waves  in  a  specified  band  of  frequency  for  all 
angles  of  arrival  and  for  all  polarization  states  of 
electromagnetic  waves.  In  practice,  however,  it  is 
very  difficult  to  obtain  such  complete  band-gap 
structures  and  partial  band-gaps  are  achieved. 
Filters  typically  cover  the  scalar  situation  and 
single  angle  of  arrival.  FSS  typically  cover  limited 
angles  of  arrival  and  respond  differently  to 
polarization  states.  PBG  typically  cover  in-plane 
angles  of  arrival  and  also  sensitive  to  polarization 
states.  Surveying  the  literature,  one  finds  that  FSS 
terminology  has  been  widely  used  in  the 
microwave  community  while  PBG  terminology 
has  been  widely  applied  in  the  optical  community. 
This  overview  paper  presents  a  powerful 
computational  engine  utilizing  Finite  Difference 
Time  Domain  (FDTD)  technique  integrated  with 


novel  extrapolating  algorithms  to  illustrate  the 
marvels  of  EBG  structures.  The  paper  addresses 
structures  such  as  (a)  FSS  structures,  (b)  PBG 
crystals,  (c)  smart  surfaces  for  communication 
antenna  applications,  (d)  surfaces  with  perfectly 
magnetic  conducting  properties  (PMC),  (e) 
creation  of  materials  with  negative  permittivity 
and  negative  permeability,  (f)  surfaces  with 
reduced  edge  diffraction  effects  and  (g)  reduction 
of  mutual  coupling  among  array  antenna 
elements.  Some  representative  applications  of 
these  structures  are  highlighted.  In  the  last  several 
years,  there  have  been  numerous  published 
conference  papers  and  journal  articles  dealing  with 
the  characterizations  and  applications  of  EBG 
structures.  This  paper  is  based  on  some  of  the 
results  published  by  the  author  and  his  co-workers 
in  the  cited  references.  The  reader  is  encouraged  to 
perform  detailed  literature  search  to  leam  more 
about  this  area. 

2.  UCLA-FDTD  CHARACTERIZATION  OF 
EBG  STRUCTURES 

To  perform  an  in-depth  assessment  of  the 
performance  characteristics  of  the  complex  EBG 
structures  the  FDTD  technique  with  Periodic 
Boundary  Condition/Perfectly  Matched  Layer 
(PBC/PML)  is  developed  by  modifying  previous¬ 
ly  developed  and  well  tested  FDTD  code  without 
periodic  boundary  conditions,  Jensen  and 
Rahmat-Samii  [1].  The  split-field  approach 
detailed  in  the  book  by  Taflove  [2],  is 
incorporated  to  discretize  the  Floquet  transformed 
Maxwell’s  equations,  Mosallaei  and  Rahmat- 
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Samii  [3].  The  broadband  analysis  capabilities  of 
the  FDTD  technique  provide  great  computational 
efficiency  and  accuracy  when  one  requires 
detennining  the  frequency  response  of  complex 
structures.  Extrapolation  schemes  such  as  Prony 
method  (Mosallaei  and  Rahmat-Samii  [4])  and 
Auto  Regressive  Moving  Average  (ARMA) 
method  (Yang  and  Rahmat-Samii  [5])  are  also 
incorporated  to  increase  the  efficiency  of  the 
computational  technique.  The  key  components  of 
this  powerful  computational  engine  are  briefed  in 
Fig.  1. 

The  developed  FDTD/Prony/ARMA  technique 
has  been  effectively  applied  to  the 
characterization  of  a  variety  of  complex  periodic 
EBG  structures  illustrated  in  Fig.  2. 
Representative  examples  are  discussed  to  highlight 
unique  features  of  the  results  and  application  areas. 

2.  ULTRA  BROADBAND  MULTI-LAYERED 
TRIPOD  FSS 

This  section  focuses  on  the  performance 
evaluation  of  the  electromagnetic  band-gap  multi¬ 
layered  tripod  FSS  (Barlevy  and  Rahmat-Samii 
[6])  with  ultra  wideband  characteristics.  Fig.  3(a) 
shows  the  geometry  of  a  single  layer  tripod  FSS. 
The  FDTD/Prony  technique  is  applied  to  analyze 
the  structure,  and  the  results  for  the  normal 
incidence  (^J  reflected  power  is  presented  in 
Fig.  3(b).  This  structure  exhibits  resonance 
behavior  around  /„  =  145  GHz . 

The  resonance  and  bandwidth  frequency  of  this 
structure  can  be  controlled  using  a  2-layer  tripod 


FSS,  as  depicted  in  Fig.  4(a).  The  second  layer  is 
rotated  180"  with  respect  to  the  first  layer,  and  is 
shifted  along  the  z-axis  in  such  a  way  that  all 
three  legs  of  each  tripod  overlap  a  leg  of  a  tripod 
in  the  first  layer.  The  overlap  region  creates  a 
capacitor,  which  is  used  to  tune  the  frequency  for 
100%  reflection.  The  reflected  power  of  the 
normal  incident  plane  wave  is  presented  in  Fig. 
4(b)  showing  that  for  the  2-layer  tripod  the 
resonance  frequency  is  shifted  down. 

To  further  broaden  the  100%  reflection 
bandwidth,  a  4-layer  tripod  FSS  is  introduced  in 
Fig.  5(a).  This  is  a  rather  complex  structure 
composed  of  two  sets  of  the  FSS  shown  in  Fig. 
4(a).  The  geometry  allows  two  degrees  of 
freedom,  d  and  D .  The  capacitance  governed  by 
the  small  distance  d  controls  the  lower  edge  of 
the  rejection  band,  where  the  large  inter-capacitor 
spacing  D  controls  the  upper  edge  of  the  band. 
The  reflected  power  for  both  the  normal  and  30‘ 
oblique  incidence  {O'  =9Q’,<f)'  =150")  for  TE  (E.) 
and  TM  (H^)  polarization  cases  are  presented  in 
Fig.  5(b).  One  notices  that  by  increasing  the 
bandwidth  of  the  100%  reflection,  an  EBG 
structure  utilizing  the  multiple  coupled  tripod 
arrays  is  designed.  For  the  angles  near  to  the 
grazing,  the  TM  waves  are  almost  nonnal  to 
surface  of  tripods  and  the  FSS  cannot  reject  them. 
Incorporating  interconnecting  vias  between  tripods 
helps  to  reject  the  normal  components.  This  design 
approaches  a  complete  band-gap  structure  as 
demonstrated  by  Barlevy  and  Rahmat-Samii  [6]. 


Fig.  1 :  Schematic  of  the  UCLA-FDTD/Prony/ARMA  computational  technique. 
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Dichroic  Plate 


Rectangular  PBG 


Sierpinski  Fractal  FSS 


Triangular  PBG 


Composite  Periodic  Material  with 
Negatives  &// 


Woodpile  PBG 


Multi-Layer  Tripod  FSS 


Fig.  2:  Different  classes  of  complex  EBG  structures  characterized  by  the  FDTD/Prony/ARMA 
technique. 
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Fig.  3:  1  “Layer  tripod  FSS,  (a)  Periodic  structure,  (b)  Normal  incident  reflected  power. 
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Fig.  4:  2-Layer  tripod  FSS,  (a)  Periodic  structure,  (b)  Normal  incident  reflected  power. 


Fig.5:  4-Layer  tripod  FSS,  (a)  Periodic  structure,  (b)  Normal  and  oblique  (TE/TM)  incidence 
reflected  power. 


3.  PBG  CRYSTALS  FOR  OPTICAL 
APPLICATIONS 

This  section  addresses  utilization  of  EBG 
structures  in  optical  regime  and  classifies  them  as 
the  photonic  band-gap  (PBG)  crystals  (Coccioli  et 
al.  [7]).  The  UCLA-FDTD  technique  is  applied  to 
obtain  the  reflection  coefficient  of  the  plane  wave 
incident  on  various  PBG  structures.  The  plane 
wave  approach  has  some  distinct  advantageous 
compared  to  the  dispersion  diagram  method  as 
listed  below: 


•  Presenting  phase  and  polarization  information 
of  the  scattered  fields, 

•  Obtaining  reflection  and  transmission 
coefficients  outside  the  band-gap  regime, 

•  Ease  of  implementation. 

For  a  5-layer  2-D  rectangular  PBG  structure  of  air 
holes  in  the  dielectric  material  and  for  both  nonnal 
and  oblique  incidence  plane  waves,  Fig.  6  shows 
the  reflection  coefficients.  There  is  a  complete 
band-gap  region  for  the  z-polarized  (TE)  waves 
for  the  normalized  frequencies  0.21  <a//l„  ^0.28. 
The  region  of  the  band-gap  frequency  band  has  an 
excellent  agreement  with  the  data  presented  in 
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Fig.6:  5-Layer  rectangular  PBG,  (a)  Periodic  structure,  (b)  Normal  and  oblique 
incidence  reflection  coefficient. 


Fig.7:  5-Layer  triangular  PBG,  (a)  Periodic  structure,  (b)  Normal  and  oblique  incidence 
reflection  coefficient. 


Joannopoulos  et  al  [8]  based  on  the  dispersion 
diagram.  The  PBG  stracture  can  be  also  created  to 
generate  a  TM  band-gap  region  using  a  triangular 
array  of  holes  (5-layer)  in  the  dielectric  material,  as 
shown  in  Fig.  7.  The  electric  field  is  polarized  in 
the  plane  normal  to  the  axis  of  holes.  One  observes 
that  there  is  a  TM  band-gap  region  in  the 
normalized  frequency  range  0.20  s  ^  0.25 .  Ho 
et  al  [9]  proposed  a  novel  PBG  structure  that  has 
the  potential  of  generating  a  complete  band-gap 
region  for  all  angles  of  incidence  and  for  all 
polarizations  (an  ideal  EBG  stmcture).  The 
geometry  of  a  2-layer  woodpile  PBG  is  depicted  in 
Fig.  8(a).  The  symmetric  arrangement  of  the 
structure  forbids  propagation  in  almost  all  the 
wave  vector  directions.  Fig.  8(b)  presents  the 


reflection  coefficient  for  the  normal,  30" 
oblique/TE,  and  arbitrary  incident  wave 
=40‘,(i' =150'  with  45'  polarization  angle 
(between  the  electric  field  and  reference  direction 
k'xz).  As  observed,  the  structure  is  able  to 
generate  an  almost  complete  band-gap  region. 
The  reflected  phase  on  the  surface  of  the  structure 
for  30'  oblique  wave  in  obtained  in  Fig.  8(c).  It  is 
interesting  to  note  that  within  the  band  gap  region 
the  phase  has  an  almost  linear  frequency  variation. 
This  means  that  the  woodpile  can  be  represented  as 
a  Perfect  Electric  Conductor  (PEC)  where  the 
location  of  the  PEC  is  variable  instead  to  be  fix  at 
the  front  surface  of  PBG. 
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Fig. 8:  2-Layer  woodpile  PBG,  (a)  Periodic  structure,  (b)  Normal  and  oblique  incidence  reflection 
coefficient,  (c)  Reflection  phase  for  the  30’  /TE  incidence  wave. 


4.  VARIOUS  APPLICATIONS  OF  EBG  the  gap  region)  becomes  localized  in  the 

transverse  plane  utilizing  the  PBG  crystal.  In  the 
One  may  wonder  how  some  of  those  novel  vertical  direction,  the  dielectric  contrast  between 

characteristics  of  EBG  structures  can  be  used  in  the  impurity  and  outside  air  region  produces  the 

engineering  designs.  This  section  discusses  total  internal  reflections  trapping  the  waves  in  this 
utilization  of  the  EBG  structures  into  some  direction  (Coccioli  et  al.  [7]). 

potential  applications  such  as  nanocavities, 

waveguides,  and  patch  antennas,  as  shown  in  Fig.  4.2  Guiding  the  light  in  sharp  bends.  Another 
9.  Some  practical  antenna  examples  using  EBG  useful  application  of  the  PBG  is  for  guiding  the 
structures  have  also  been  fabricated  and  tested.  light  in  sharp  bends.  This  is  achieved  by  an  array 

4.1  High  Q  nanocavities.  To  create  a  high  Q  of  the  PBG  holes  removed  in  the  guiding 

cavity  for  distributed  laser  applications  a  finite  direction  as  shown  in  Fig.  9(b).  In  the  frequency 
thickness  2-D  PBG  structure  is  used  to  localize  range  within  the  gap-region,  light  is  confined 

the  electromagnetic  waves  inside  a  defect  region  through  the  channel,  and  cannot  be  scattered 

in  three  directions.  The  confinement  is  based  on  through  the  PBG.  This  is  even  the  case  at  tight 

the  PBG  gap/total  internal  reflections  properties  comers.  It  has  been  shown  that  by  appropriate 

of  the  stmcture.  The  defect-excited  mode  (inside  shaping  of  the  bend  one  can  reduce  scattering  of 


^  E  tdB)  E  (dB) 

(a)  (b)  (c) 

Fig. 9;  Potential  applications  of  the  PBG  structures  in  (a)  High  Q  nanocavities,  (b)  Guiding  the  EM 
waves  in  sharp  bends,  and  (c)  Miniaturized  microstrip  patch  antennas. 
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the  light  waves  at  sharp  comers. 

4.3  Miniaturized  microstrip  patch  antennas. 
Miniaturized  patch  antennas  can  be  constracted 
using  the  EBG  as  the  substrate  surrounding  the 
patch  as  shown  in  Fig.  9(c).  This  results  into 
suppression  of  the  surface  waves.  Although,  in 
general,  this  EBG  stracture  cannot  generate  a 
complete  surface  wave  band-gap,  it  still  has  the 
capability  of  suppressing  surface  waves,  and 
improving  the  radiation  performance  of  the  patch 
antenna  (Colburn  and  Rahmat-Samii  [10]). 

4.4  Low  profile  CP  curl  antenna  design.  The 

curl  antenna  was  originally  proposed  as  a  simple 
radiator  to  generate  circular  polarized 
electromagnetic  waves.  This  antenna  concept 
does  not  function  efficiently  when  it  is  placed 
close  to  a  finite  PEC  ground  plane  due  to  the 


reverse  image.  Therefore,  the  mushroom  EBG 
surface,  which  shows  an  in-phase  reflection 
feature  inside  its  band  gap,  provides  a  good 
alternate  to  the  PEC  ground  plane  to  build  a  low 
profile  structure  as  well  as  the  circular 
polarization  (Yang  and  Rahmat-Samii  [11]). 

Fig.  10(a)  shows  a  photo  of  a  curl  antenna 
designed  by  the  UCLA-FDTD  computational 
tools.  The  height  of  the  antenna  is  only  3  mm, 
which  is  about  0.07  ^  at  7  GHz.  Compared  to  a 
curl  on  the  PEC  ground  plane,  the  curl  on  the 
EBG  ground  plane  has  a  much  better  input  match 
as  shown  in  Fig.  10(b).  The  axial  ratio  of  the 
antenna  is  also  measured  and  plotted  in  Fig. 
10(c).  A  good  axial  ratio  of  0.9  dB  is  achieved  at 
7.18  GHz. 
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Fig.  10:  A  low  profile  circularly  polarized  curl  antenna  on  a  finite  mushroom  EBG  surface:  (a)  a  photo 
of  the  design,  (b)  return  loss  of  the  curl  on  the  EBG  compared  to  that  of  a  curl  on  a  PEC  surface,  and 
(c)  axial  ratio  (AR)  of  the  curl  anterma. 


4.5  Microstrip  antenna  element  within  an  EBG 
for  enhanced  performance.  Applications  of 
microstrip  antennas  on  high  dielectric  constant 
substrates  are  of  growing  interest  due  to  their 
compact  size  and  conformability  with  monolithic 
microwave  integrated  circuit  (MMIC).  However, 
there  are  some  drawbacks  with  the  utilization  of 
high  dielectric  constant  substrates  such  as  a 
narrow  bandwidth  and  pronounced  surface  waves. 
The  bandwidth  can  be  recovered  using  a  thick 
substrate;  however,  this  excites  severe  surface 
waves,  which  will  decrease  the  antenna 


efficiency,  degrade  the  antenna  pattern  and 
increase  the  mutual  coupling.  Because  the 
mushroom  EBG  structure  exhibits  a  surface- wave 
suppression  feature,  it  is  applied  to  the  patch 
antenna  and  array  to  improve  their  performance. 

Fig.  11(a)  shows  a  photo  of  a  microstrip  antenna 
surrounded  by  four  rows  of  EBG  patches  and  Fig. 
11(b)  presents  the  E  plane  radiation  pattern  of  the 
antenna  compared  to  other  conventional 
microstrip  antenna  designs  (Yang  and  Rahmat- 
Samii  [12]).  It  is  observed  that  the  radiation 
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Fig.  1 1 :  Enhanced  perfonnance  microstrip  antenna  with  EBG  around:  (a)  a  photo  of  the  design  and 
(b)  measured  radiation  patterns  of  various  microstrip  anteiuia  designs.  It  is  noticed  that  the  microstrip 
antenna  with  EBG  around  has  the  highest  gain  and  lowest  back  lobe. 


performance  of  the  antenna  with  EBG  around  is 
the  best:  its  front  radiation  is  the  highest  that  is 
about  3  dB  higher  than  the  thick  case  while  its 

4.6  Microstrip  antenna  array  with  EBG  in 
between  for  reduced  mutual  coupling.  Above 
experiments  have  demonstrated  that  the  EBG 
structure  can  suppress  surface  waves  successfully 
so  that  the  performance  of  an  antenna  element  is 
significantly  improved.  This  approach  has  also 
been  extended  to  array  applications  (Yang  and 
Rahmat-Samii  [13]).  Two  pairs  of  microstrip 
antenna  arrays  were  fabricated  on  Roger 
RT/Duroid  6010  substrates.  Fig.  12(a)  shows  a 
photograph  of  the  fabricated  antenna  arrays  with 
and  without  EBG  in  between.  The  antenna  size 


back  lobe  is  the  lowest  which  is  more  than  1 5  dB 
lower  than  other  cases. 

and  EBG  parameters  are  carefully  designed  by  the 
UCLA-FDTD  tools.  The  measured  results  are 
plotted  in  Fig.  12(b).  It  is  noticed  that  both 
antennas  resonate  at  5.86  GHz  with  impedance 
matches  better  than  -10  dB.  For  the  antennas 
without  the  EBG  structure,  the  mutual  coupling  at 
5.86  GHz  is  -16.8  dB.  As  a  comparison,  the 
mutual  coupling  of  the  antennas  with  the  EBG  in 
between  is  only  -24.6  dB.  About  8  dB  reduction 
of  mutual  coupling  is  achieved  at  the  resonant 
frequency.  From  this  experimental  demonstration, 
it  can  be  concluded  that  the  EBG  structure  can  be 


Fig.  12:  Microstrip  antenna  array  with  reduced  mutual  coupling:  (a)  photos  of  microstrip  antenna 
arrays  with  and  without  EBG  in  between  and  (b)  measured  return  loss  and  mutual  coupling  results  of 
the  arrays.  It  is  observed  that  the  mutual  coupling  is  greatly  reduced  when  the  EBG  is  inserted 
between  patch  elements. 
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utilized  to  reduce  the  mutual  coupling  between 
elements  of  antenna  arrays. 

5.  CONCLUSIONS 

This  invited  plenary  session  paper  provides  an 
overview  of  research  activities  at  author’s 
laboratory  at  UCLA  in  the  area  of  EBG  structures. 
As  such,  it  is  based  on  some  of  the  results 
published  by  the  author  and  his  co-workers  in  the 
cited  references.  Computational  tools  of 
FDTD/Prony/AMRA  are  discussed.  Ample 
representative  examples  are  provided  to  highlight 
the  marvels  of  EBG  stmctures  for  both  microwave 
and  optical  applications.  Characterizations  of  EBG 
structures  and  related  meta-materials  are 
interesting  research  frontiers  with  potential 
electromagnetic  engineering  system  applications. 
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A  Decompose-Solve-Recompose  (DSR)  Technique 
for  Large  Phased  Array  Antenna  Analysis 

K.  Y.  Sze',  K.  F.  Sabet^  and  D.  Chun^ 


Abstract:  A  novel  spatial  Decompose-Solve-Recompose 
(DSR)  technique  is  demonstrated  to  be  very  attractive 
for  analyzing  uniform  and  non-uniform  large  phased 
array  (LPA)  antennas,  because  it  can  accurately  ac¬ 
count  for  array  edge  effects.  A  simple  concurrent  peri- 
odic/non-periodic  analysis  scheme,  similar  to  that  util¬ 
ized  in  the  Progressive  Numerical  Method  (PNM),  is 
presented  for  the  modeling  of  planar  large  phased  ar¬ 
ray  antennas.  The  resulting  2D  spatial  DSR  technique, 
known  as  the  Hybrid  Edge-Periodic  DSR  technique, 
requires  the  decomposition  of  a  large  planar  array  into 
an  outer  edge  “ring”  array  and  a  central  periodic  array 
block. 

Index  Terms:  spatial  DSR,  large  phased  array  (LPA) 
antenna,  PNM,  periodic,  non-periodic.  Hybrid  Edge- 
Periodic,  uniform  array,  non-uniform  array. 


1.  INTRODUCTION 

The  full-wave  analysis  of  large-scale  phased  array  sys¬ 
tems  poses  a  very  challenging  computational  electromag¬ 
netic  problem.  Conventional  full-wave  techniques  such  as 
the  Method  of  Moments  (MoM)  can  handle  small-  to  me¬ 
dium-scale  problems  relatively  easily.  When  the  size  of  the 
array  exceeds  a  hundred  elements,  full-wave  techniques 
reach  their  limit  of  applicability.  For  larger  arrays,  periodic 
simulators  are  often  utilized,  whereby  the  array  is  assumed 
to  have  an  infinite  extent.  However,  periodic  techniques 
cannot  predict  edge  effects  due  to  the  radiating  elements 
located  at  the  boundary  of  the  finite-size  array  structure. 
Therefore,  it  is  essential  to  develop  a  technique  that  utilizes 
the  full-wave  analysis  of  the  array  in  an  efficient  manner 
while  being  able  to  recognize  the  finite  size  of  the  array 
and  account  for  the  edge  effects. 

Several  techniques  are  presently  available  in  the  litera¬ 
ture  on  the  analysis  and  design  of  large  phased  arrays.  The 
truncated  Floquet  Wave/GTD  formulation  [1],  [2]  utilizes 
a  Floquet  mode  truncation  method  to  model  a  plane  wave 
illumination  of  a  large  array  of  dipole  elements  in  conjunc¬ 
tion  with  the  GTD  technique  to  account  for  edge  element 


diffractions.  This  approach  was  also  extended  to  include  a 
mildly  tapered  plane  wave  illumination  of  the  dipole  array 
[3].  Another  new  hybrid  technique,  the  Discrete  Fourier 
Transform/Moment  Method  (DFT-MoM)  [4],  also  incor¬ 
porates  the  high  frequency  GTD  analysis  to  include  edge 
diffractions.  Additionally,  for  a  large  scatterer  analysis,  a 
relatively  similar  technique  used  is  one  that  is  based  on 
MoM  and  combined  with  a  new  asymptotic  formulation 
known  as  the  asymptotic  phase-front  extraction  (APE)  [5]. 
This  technique  utilizes  results  from  low  frequency  simula¬ 
tions  to  predict  solutions  at  higher  frequencies,  so  that 
computational  effort  and  memory  requirements  are  signifi¬ 
cantly  reduced.  Nevertheless,  all  these  asymptotic  tech¬ 
niques  are  generally  very  complex  and  are  presently  appli¬ 
cable  only  to  simple  geometries.  In  addition,  a  somewhat 
new  matrix  decomposition  technique  was  introduced  in 
[6],  using  the  Generalized  Forward-Backward  Method 
(GFBM),  in  which  the  global  impedance  matrix  is  decom¬ 
posed  into  forward  and  backward  components  instead  of 
the  submatrices.  Although  this  is  proven  to  be  accurate  and 
efficient  for  rough  surface  scattering  problems,  further 
studies  are  necessaiy  to  confirm  its  accuracy,  efficiency 
and  robustness  applicable  to  large  phased  array  analyses. 

On  the  issue  of  mutual  coupling  in  a  non-uniform 
(aperiodic)  array,  papers  [1 1]“[14]  discussed  some  analysis 
methods  using  periodic  sources  for  modeling  a  single 
source  in  an  otherwise  large  uniform  array,  which  is  a  sin¬ 
gly-perturbed  non-uniform  array  problem.  Nevertheless, 
there  is  still  a  great  demand  for  a  more  generalized  method 
that  handles  a  multiply-perturbed  non-uniform  array  prob¬ 
lem,  and  this  is  thus  the  focus  of  this  paper. 

In  this  paper,  a  brief  overview  is  given  of  a  proposed 
simpler  concurrent  periodic/non-periodic  analysis  scheme, 
the  Decompose-Solve-Recompose  (DSR)  technique  [7], 
adapted  to  the  modeling  of  planar  large  phased  array 
(LPA)  systems.  The  resulting  2D  spatial  DSR  technique, 
known  as  the  Hybrid  Edge-Periodic  DSR  technique,  re¬ 
quires  the  decomposition  of  a  large  planar  array  into  an 
outer  edge  “ring”  array  and  a  central  periodic  array  block. 
In  addition,  its  computation  speed  and  efficiency  may  be 
further  enhanced  by  means  of  a  2D  Progressive  Numerical 
Method  (PNM)  like  algorithm  described  in  [8]-[10].  An 
analysis  using  the  Hybrid  Edge-Periodic  DSR  technique  is 
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presented  for  uniform  and  non-uniform  LPA  examples, 
similar  to  that  for  the  uniform  12x1 2-element  LPA  re¬ 
ported  in  [7].  These  studies  are  part  of  an  effort  to  under¬ 
stand  the  characteristics  of  the  Hybrid  Edge-Periodic  DSR 
technique  for  applications  to  more  general  uniform  and 
non-uniform  LPA  analyses  and  designs.  Traditional  ap¬ 
proaches,  such  as,  that  computed  by  a  brute  force  Method 
of  Moments  (MoM)  technique,  and  a  simpler  approxima¬ 
tion  approach  using  the  periodic  array  windowing  ap¬ 
proach,  are  employed  for  comparisons. 


II.  HYBRID  EDGE-PERIODIC  DSR 
TECHNIQUE 

A  2D  spatial  DSR  analysis,  using  theHy  brid  Edge- 
Periodic  DSR  technique,  is  employed  for  the  modeling  of  a 
planar  array  of  dipoles  depicted  schematically  in  Fig.  1. 
This  DSR  technique  is  new,  and  involves  the 
decomposition  of  an  LPA  into  an  outer  edge  “ring”  array 
and  a  central  block  of  periodic  array,  as  shown  int  he 
figure.  Each  of  these  decomposed  arrays  are  solved 
independently  using  the  full-wave  MoM  (or  any  other  full- 
wave  analysis  methods),  and  subsequently,  recomposed 
back  as  a  solution  to  the  original  problem. 
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Fig.  I:  Discarding  an  edge  element  ring  for  an  8x8  planar 
array  using  the  Hybrid  Edge-Periodic  DSR  technique.  The 
original  edge  element  ring  cluster  (top  left)  is  2  rings  wide, 
and  with  the  second  ring  in  the  cluster  discarded  (i.e.  over¬ 
lapped  by  the  periodic  element  cluster),  only  the  first  ring 
is  retained  (bottom  right). 

Additional  improvements  of  the  Hybrid  Edge-Periodic 
DSR  technique  may  be  achieved  through  the  use  of  region 
“overlapping”  between  edge  rings  and  the  periodic  array 
block,  as  implemented  similarly  in  a  PNM  algorithm  in  [8]. 
An  optimal  choice  of  edge  element  ring  width  can  also 
yield  better  accuracy.  The  mechanism  of  region  “overlap¬ 


ping”  requires  that  inner  edge  rings  be  discarded  and  outer 
rings  retained  during  the  recomposition  of  solution.  Peri¬ 
odic  elements  are  then  substituted  in  their  place  so  that  the 
final  solution  will  still  represent  the  correct  number  of  ar¬ 
ray  elements  and  their  spatial  positions  in  Euclidean  space, 
as  illustrated  in  Fig.  1 .  That  is, 

Total  Rings  =  Rings  Retained  +  Rings  Discarded.  (1) 

These  discarded  rings  actually  served  as  “pawns”  for  ap¬ 
proximating  the  mutual  coupling  effects  on  the  rings  re¬ 
tained. 

Fig.  2  shows  am  atrix  block  representation  of  a  de¬ 
composed  full-wave  MoM  impedance  matrix,  consisting  of 
different  submatrix  blocks.  These  blocks  are  [  Aa  ],  [  A^  ], 
[  Ac  ],  [  Actr  ]  and  [  Aedg  ],  which  respectively  represent 
couplings  (interactions  between  basis  and  test  functions) 
between  elements  in  the  overlap  and  edge  ring  regions, 
overlap  and  central  block  regions,  overlap  region  only, 
central  block  region  only,  and  edge  ring  region  only. 


Comhined  matrix  system  for  the  whole  problem  domain 


Fig.  2:  Decomposed  matrix  blocks  of  the  full-wave  MoM 
impedance  matrix. 

However,  this  combined  matrix  system  is  not  directly 
solved  as  a  single  matrix  system,  but  rather,  as  two  sepa¬ 
rate  smaller  matrix  problems.  In  particular,  one  solution  is 
computed  for  the  central  block  of  periodic  array,  and  an¬ 
other  for  the  edge  ring  array.  Mathematically,  this  method 
may  also  be  considered  as  a  form  of  matrix 
decomposition  technique,  with  its  methodology  based  on 
physical  2D  spatial  decomposition.  Further  detailed 
mathematical  formulations  are  found  in  [9],  [10]. 

For  the  central  block  region,  with  the  solution  (current) 
vector  assumed  as  [  Cetr  voltage  vector  rep¬ 

resented  as  [  betr  ]  *,  the  submatrix  equation  becomes 
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[  ^ctr  ]  [  ^ctr  ]  ]  [  ^ctr]^  (2) 

[  “^b  ]  [  ^ctr  ]“^[-A.c][cb]  =  [  be],  (3) 

Similarly,  for  the  edge  ring  region,  with  solution  (current) 

vector  [  Ca  Cedg  ]'  and  voltage  vector  [  be  bedg  ]  ,  tbe 
submatrix  equation  becomes 

[Ac][Ca]  +  [Aa][Cedg]  =  [bcL  (4) 

[  Aa  ]  [  Ca  ]  +  [  Aedg  ]  [  Cedg  ]  ~  [  bedg]-  (5) 

Subsequently,  by  recomposing  the  solution  vectors  of  (2) 
through  (5)  into  a  new  single  solution  vector,  which  be¬ 
comes  [  Cetr  Cb  Cedg  J  ^  with  the  sub  vector  [  Ca  ]  dis¬ 
carded. 

For  the  modeling  of  an  LPA  on  a  platform  in  the  vicin¬ 
ity  of  objects  such  as  screws,  fasteners  and  pins,  as  sche¬ 
matically  depicted  in  Fig.  3,  the  Hybrid  Edge-Periodic  DSR 
technique  can  be  employed,  with  additional  considerations 
for  adjacent  objects  to  be  solved  as  part  of  the  edge  element 
array  in  the  DSR  algorithm.  For  the  ease  of  developing  the 
DSR  technique,  however,  uniform  and  non-uniform  LPAs 
are  utilized  as  simple  test  examples  in  the  proving  of  con¬ 
cepts  in  this  paper,  since  their  radiation  behaviors  are  gen- 
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Fig.  3:  Schematic  of  a  Hybrid  Edge-Periodic  DSR  model 
for  the  analysis  of  an  LPA  in  the  vicinity  of  other  objects. 


III.  LARGE  PHASED  ARRAY  MODELS 

In  this  analysis,  a  uniformly-excited  8x8-element  array 
of  microstrip  dipoles  etched  on  a  foam  substrate  (8r=1.03) 
of  thickness  0.1 9Xo  is  employed,  with  dipole  length  and 
width  being  0.39Xo  and  0.002Xo,  respectively,  and  element 
spacings  in  the  x-  and  y-directions  being  0.5A,o  and 
0,333Xo,  respectively.  The  accuracy  of  this  spatial  DSR 


technique  over  the  traditional  periodic  array  windowing 
approach  is  also  investigated  for  a  24x24-element  uniform 
array  of  microstrip  dipoles  etched  on  a  8r=2.2  substrate  of 
thickness  0.188Xd,  where  The  array  dipoles  are 

center-fed,  each  having  a  length  and  width  of  0.578A.d  and 
0.003Xd,  respectively,  and  their  center-to-center  element 
spacings  in  the  x-  and  y-directions  are  0.742Xd  and 
0.494Xd,  respectively.  These  dipoles  are  oriented  parallel  to 
the  x-axis,  giving  an  Ex  field  polarization.  The  full-wave 
MoM  solutions  are  computed  using  E'A/PiCASSO,  a  well- 
established  commercial  EM  CAD  software  tool  from 
EMAG  Technologies,  Inc. 

For  the  24x24-element  array,  the  full-matrix  solution  is 
equivalent  to  the  case  having  a  total  of  12  square  rings  with 
no  rings  discarded  (i.e.  with  no  periodic  elementu  tilized  in 
the  DSR  modeling),  while  the  periodic  array  windowing 
solution  is  equivalent  to  that  without  any  rings  (i.e.  with 
only  periodic  elements  utilized  in  the  DSR  simulation).  For 
example,  a  zero  number  of  rings  corresponds  to  a  win¬ 
dowed  periodic  array  solution.  For  a  total  number  of  rings 
between  these  two  extremes,  results  obtained  are  from 
combinations  of  solutions  for  both  edge  rings  and  inner 
periodic  elements.  The  amount  of  region  overlap  is  thus 
implicitly  represented  by  the  number  of  rings  discarded. 

Extending  the  modeling  to  a  non-uniform  LPA,  the 
uniform  24x24-element  LPA  is  subsequently  modified  to 
consist  of  48  cross-polarized  dipoles  arranged  alternately  at 
the  array  edge.  A  similar  DSR  procedure  is  then  utilized  for 
this  non-uniform  case. 

IV.  FAR-FIELD  RADIATION 
CHARACTERISTICS 

Far-field  radiation  characteristics  for  the  uniform  8x8- 
element  LPA  are  computed  using  the  full-matrix  (full-wave 
exact  solution),  periodic  array  windowing  and  Hybrid 
Edge-Periodic  DSR  (using  2  and  3  edge  element  rings 
without  any  region  overlap)  techniques.  Thus,  directivities 
computed  for  this  LPA  are  21.670dBi,  21.590dBi, 
21.644dBi  and  21.671dBi,  respectively.  These  techniques 
all  exhibit  good  accuracy,  with  the  3 -element  ring  Hybrid 
Edge-Periodic  approach  producing  the  best  accuracy.  Fig.  4 
illustrates  the  far-field  radiation  patterns  of  this  LPA.  The 
Hybrid  Edge-Periodic  technique  with  both  2-  and  3- 
element  rings  shows  excellent  agreement  with  the  full- 
matrix  solution.  Although  results  for  the  periodic  array 
windowing  technique  indicate  relatively  good  accuracy  for 
near  broadside  observation  angles,  its  predictions  of  side- 
lobe  levels  (SLL)  at  far  observation  angles  incur  large  er¬ 
rors.  The  Hybrid  Edge-Periodic  technique,  on  the  other 
hand,  significantly  improves  this  discrepancy. 
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(a) 


Fig.  4:  Far-field  radiation  patterns  of  a  uniform  8x8- 
element  array  of  microstrip  dipoles  etched  on  a  foam  sub¬ 
strate  (€r=L03)  of  thickness  0.19Xo:  (a)  E-plane,  and  (b) 
H-plane.  Dipole  lengths  and  widths  are  0.39 Xq  and 
0.002 Xo,  respectively,  and  element  spacings  in  the  x-  and y- 
directions  are  O.SXo  and  0.333Xo,  respectively. 


Far-field  radiation  characteristics  for  the  uniform 
24x24-element  LPA  are  computed  using  the  same  full- 
matrix,  periodic  array  windowing  and  Hybrid  Edge- 
Periodic  DSR  techniques  described  above.  Their  far-field 
radiation  patterns  are  shown  in  Fig.  5,  and  their  corre¬ 
sponding  directivities  are  30.89dBi,  30.94dBi  and 

30.87dBi,  respectively.  For  the  Hybrid  Edge-Periodic  mod¬ 
eling,  radiation  patterns  are  obtained  using  a  total  of  7  edge 
element  rings  with  a  4-ring  overlap.  With  realistic  array 
edge  effect  incorporated  into  the  analysis,  this  model  pre¬ 


dicts  pattern  SLL  with  good  accuracy.  More  accurate  SLL 
may  be  obtained  through  the  use  of  an  optimal  choice  of 
the  number  of  edge  rings  and  overlapping.  For  the  periodic 
array  windowing  approach,  on  the  other  hand,  distinct  nulls 
are  predicted  which  are  especially  unrealistic  in  the  H- 
plane. 


(b) 

Fig.  5:  Far-field  radiation  patterns  of  a  uniform  24x24- 
element  array  of  microstrip  dipoles  etched  on  a  Sr=2.2 
substrate  of  thickness  O.J88Xd  (where  X^—  Xp/Vsr),  obtained 
using  different  techniques:  (a)  E-plane,  and  (b)  H-plane. 
Oriented  parallel  to  the  x-axis,  the  dipoles  have  lengths 
and  widths  0.578Xj  and  0.003 X^,  respectively,  and  element 
spacings  in  the  x-  and  y-directions  are  0.742X({  and 
0,494 X^,  respectively.  The  Hybrid  Edge-Periodic  results 
are  computed  using  a  total  of  7  edge  element  rings  with  a 
4-ring  overlap. 
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In  addition,  the  Hybrid  Edge-Periodic  DSR  technique 
is  capable  of  accurately  predicting  far-field  radiation  char¬ 
acteristics  of  a  non-uniform  24x24-element  LPA,  as  illus¬ 
trated  in  Fig.  6  and  Fig.  7.  For  this  case,  directivities  for  the 
full-matrix  and  Hybrid  Edge-Periodic  techniques  are 
30.48dBi  and  30.51dBi,  respectively,  and  their  cross- 
polarized  “main”  lobes  are  18.90dB  (for  full-matrix  ap¬ 
proach)  and  18.49dB  (for  Hybrid  Edge-Periodic  technique) 
below  their  co-polarized  counterparts,  respectively. 


Fig.  6:  Co-polarized  far-field  radiation  patterns  of  a  non- 
uniform  24x24-element  array  of  microstrip  dipoles:  (a)  E- 
plane,  (b)  H-plane.  Hybrid  Edge-Periodic  plots  are  non- 
optimal  results  computed  using  a  total  of  7  edge  element 
rings  with  a  4-ring  overlap,  while  all  other  array  parame¬ 
ters  are  the  same  as  in  Fig.  5. 


Fig.  7  also  demonstrates  that  very  accurate  cross¬ 
polarization  results  can  be  achieved  through  the  Hybrid 
Edge-Periodic  technique.  This  is  attributed  to  the  cross- 
polarized  fields,  which  are  contributed  only  by  the  y- 
directed  dipoles  at  the  array  edge,  as  being  solved  using  the 
full-wave  MoM  as  part  of  the  edge  ring  array,  and  that, 
there  is  no  coupling  of  these  y-directed  dipoles  with  ele¬ 
ments  beyond  a  6-element  distance.  Furthermore,  with  a 
proper  choice  of  the  number  of  edge  rings  and  overlapping 
utilized,  co-polarized  radiation  patterns  can  be  further  im¬ 
proved  as  well. 


Fig.  7:  Cross-polarized  far-field  radiation  patterns  of  a 
non-uniform  2 4x2 4-element  array  of  microstrip  dipoles: 
(a)  E-plane,  (b)  H-plane.  All  array  parameters  are  the 
same  as  in  Fig.  6. 
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V.  COMPUTATIONAL  PEPIFORMANCE 

CPU  times  and  memory  storage  requirements  for  simu¬ 
lations  of  the  uniform  12x12-  and  24x24-element  LPAs  are 
briefly  investigated  in  this  section.  More  rigorous  quantita¬ 
tive  studies  and  benchmarkings  of  these  are  necessary, 
which  will  be  addressed  separately. 

A  cost  function  analysis  using  the  Hybrid  Edge- 
Periodic  DSR  technique  is  also  presented  in  this  section  for 
the  non-uniform  24x24-element  LPA  described  in  Section 
IV.  A  similar  analysis  for  the  uniform  12x1 2-element  LPA 
was  reported  in  [7].  These  are  part  of  an  effort  to  determine 
the  accuracy  of  the  solutions  obtained  and  to  understand 
the  convergence  characteristics  of  the  DSR  technique. 

Table  1  compares  the  CPU  times  for  modeling  a  uni¬ 
form  12x1 2-element  LPA  using  the  Hybrid  Edge-Periodic 
DSR  technique,  with  23  mesh  segmentations  on  each  di¬ 
pole  element.  As  compared  to  that  of  the  full-matrix  MoM 
simulation  (full-wave  MoM  simulation),  the  number  of 
unknowns  required  using  this  technique  is  reduced  by  ap¬ 
proximately  25%,  and  its  CPU  time  is  decreased  by  more 
than  55%.  From  Table  2,  its  memory  storage  size  is  also 
decreased  by  more  than  40%.  Nonetheless,  the  conven¬ 
tional  periodic  array  windowing  technique  (which  does  not 
account  for  finite  array  edge  effects)  is  still  the  most  com¬ 
putationally  cost  efficient  technique  of  the  3  cases  investi¬ 
gated,  if  array  edge  effects  can  be  neglected. 


As  for  the  uniform  24x24-element  LPA,  modeled  us¬ 
ing  8  mesh  segmentations  per  dipole  element,  the  number 
of  unknowns  required  is  reduced  by  more  than  55%,  and 
more  than  15%,  for  the  case  of  3  and  7  edge  element  rings, 
respectively.  These  results  are  as  presented  in  Table  3. 
Their  memory  storage  sizes  are  also  decreased  by  more 
than  80%,  and  more  than  30%,  respectively  (also  shown  in 
Table  3). 

These  observations  of  results  from  Table  1  through 
Table  3  indicate,  that,  the  larger  the  array  size  is,  the 
greater  is  the  computational  cost  saving  in  terms  of  CPU 
time  and  memory  storage  requirements. 


Table  3:  Comparisons  of  memory  storage  sizes  times  and 
the  number  of  unknowns  required  for  simulating  a  uniform 
24x24-element  LPA  at  8  mesh  segmentations  per  dipole 
element 


Computational 

technique 

Number  of 
unknowns* 

Memory* 

(MB) 

Full-matrix  MoM 

4608  (100%) 

1590.9(100%) 

Hybrid  Edge-Periodic 
(3  edge  element  rings) 

2024  (43.92%) 

307.1  (19.30%) 

Hybrid  Edge-Periodic 
(7  edge  element  rings) 

3816(82.81%) 

1091.1 

(68.58%) 

*  Values  in  parentheses  are  relative  percentages  to  that  of  the  full-matrix 
MoM  simulation. 


Table  1:  Comparisons  of  CPU  times  and  the  number  of  un¬ 
knowns  required  for  simulating  a  uniform  12x1 2-element 
LPA  at  23  mesh  segmentations  per  dipole  element 


Computational 

technique 

Number  of 
unknowns* 

CPU  time* 
(minutes) 

Full -matrix  MoM 

3312(100%) 

187.8  (100%) 

Periodic  array 

23  (0.69%) 

0.02  (0.01%) 

Hybrid  Edge-Periodic 
(3  edge  element  rings, 
with  1 -ring  overlap) 

2507  (75.69%) 

80.3  (42.76%) 

*  Values  in  parentheses  are  relative  percentages  to  that  of  the  full-matrix 
MoM  simulation. 


Table  2:  Comparisons  of  memory  storage  sizes  for  simulat¬ 
ing  a  uniform  12x12-element  LPA  at  23  mesh  segmenta¬ 
tions  per  dipole  element 


Computational  technique  \ 

Memory*  (MB) 

Full-matrix  MoM 

822(100%) 

Periodic  array 

0.04  (0.005%) 

Hybrid  Edge-Periodic 
(3  edge  element  rings) 

471.1  (57.31%) 

*  Values  in  parentheses  are  relative  percentages  to  that  of  the  full-matrix 
MoM  simulation. 


For  the  non-uniform  24x24-element  LPA,  its  cost 
function  analysis  results  are  illustrated  in  Fig.  8.  A  full- 
matrix  solution  (full-wave  MoM  solution)  is  used  as  exact 
solutions  for  comparing  the  errors  in  directivity.  For  in¬ 
stance,  for  the  DSR  modeling  results  shown  in  Fig.  6,  the 
error  in  directivity  is  0.03dB,  which  can  be  further  im¬ 
proved  if  necessary.  Although  this  error  in  directivity  is 
relatively  small,  errors  in  its  far- field  radiation  patterns  can 
be  significant. 

The  directivity  sensitivities  to  total  number  of  rings 
used,  retained  and  discarded  (overlapped)  are  also  illus¬ 
trated  in  Fig.  8.  That  is,  the  directivity  becomes  less  sensi¬ 
tive  as  more  rings  are  retained  (or,  less  rings  are  dis¬ 
carded/overlapped),  and  the  total  number  of  rings  utilized 
is  increased.  However,  the  computational  cost  is  also  in¬ 
creased.  Nonetheless,  two  types  of  convergence  behaviors 
are  observed  in  the  figures.  Namely,  a  convergence  occurs 
for  an  increasing  total  number  of  rings  utilized,  and  an¬ 
other,  for  an  increasing  number  of  rings  retained  (or,  for  a 
decreasing  number  of  rings  discarded/overlapped).  The 
cost  function  curves  also  appear  to  indicate  a  convergence 
to  a  non-zero  dB  error,  which  is  mostly  attributed  to  nu¬ 
merical  round-off  errors  in  the  directivity  computations. 
Thus,  based  on  these  observations,  optimal  DSR  parame¬ 
ters  for  a  best  accuracy  at  minimal  cost  can  easily  be  ob¬ 
tained  from  data  in  the  figures. 
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The  cost  function  analysis  results  for  the  uniform 
24x24”element  LPA,  described  in  Section  IV,  can  also  be 
similarly  determined.  Nevertheless,  the  discarding  of  edge 
element  rings  is  generally  more  expensive  since  more  rings 
are  necessary  and  the  computational  cost  increases  with  the 
increasing  number  of  total  rings  (due  to  the  use  of  full- 
wave  technique  for  the  edge  array  computation). 


Total  Number  of  Rings 
(^) 


(b) 

Fig.  8:  Hybrid  Edge-Periodic  cost  function  curves  for  a 
non-uniform  24x24-element  array:  (a)  effect  of  increasing 
number  of  rings  retained,  and  (b)  effect  of  overlapping 
regions,  represented  as  rings  discarded  (where  Tot.R.  ^ 
Total  Rings),  for  a  particular  total  number  of  rings.  All 
other  array  parameters  are  the  same  as  in  Fig.  5. 


VI.  CONCLUSION 

A  region  overlap  mechanism,  similar  to  that  utilized  in 
PNM,  is  implemented  into  a  newly  proposed  Hybrid  Edge- 


Periodic  DSR  technique  for  the  2D  spatialDSR  analysis  of 
planar  LPA  systems.  Simulations  of  the  uniform  8x8-  and 
12x1 2-element  LPAs,  and  both  uniform  and  non-uniform 
24x24-element  LPAs,  provide  very  good  results  and  also 
demonstrate  high  computational  efficiency.  Although  the 
periodic  array  windowing  approach  produces  unrealistic 
distinct  nulls  in  its  far-field  radiation  patterns,  but  yields 
acceptable  accuracy  for  a  uniform  LPA,  the  Hybrid  Edge- 
Periodic  DSR  technique  has  proven  to  be  more  superior  for 
a  large-scale  non-uniform  LPA,  and  may  be  its  only  practi¬ 
cal  modeling  solution.  In  essence,  pattern  improvements  in 
the  Hybrid  Edge-Periodic  method  are  generally  attributed 
to  the  choice  of  optimal  total  number  of  edge  rings  and 
overlapping,  which  serve  as  important  simulation  criteria, 
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ABSTRACT 

A  new  microstrip  patch  antenna,  capable  of  dual  or 
triple  band  operation  is  proposed.  The  patch  is 
circular  in  shape  and  is  loaded  by  two  shorting  pins 
having  a  certain  angular  separation  of  2a.  The 
dominant  cavity  modes  of  the  patch  antenna  are 
studied  using  a  developed  rigorous  theory.  It  is  found 
that  the  first  three  dominant  modes  of  the  loaded 
patch  can  act  as  good  radiators  with  distinct  resonant 
frequencies,  hence  providing  dual  or  triple-band 
operation.  Salient  mode  characteristics  such  as  the 
resonant  frequency,  radiation  power,  and  the 
radiation  quality  factors  are  derived.  Numerical 
results  showing  the  dependence  of  these  parameters 
on  the  angular  separation  angle  2a  and  the  patch 
geometry  are  presented  and  compared  with  several 
simulations  obtained  by  Zealand  IE3D  software. 

L  INTRODUCTION 

With  the  growing  interest  in  wireless  communication 
and  the  emergence  of  new  generations  there  has  been 
a  need  to  use  the  same  antenna  for  more  than  one 
band.  Several  approaches  have  been  proposed  for 
single  feed  dual  frequency  operation.  These  include 
the  annular  slot  antenna  with  capacitor  loading  [1], 
the  compact  FIFA  [2]  and  the  H-  microstrip  antenna 
[3],  Here  we  propose  a  new  patch  antenna 
configuration  that  can  achieve  dual  or  multiband 
operation.  The  proposed  antenna  is  a  circular  patch 
loaded  with  two  shorting  pins  as  depicted  in  Fig.l. 
The  case  of  a  circular  patch  with  a  single  shorting  pin 
has  been  studied  by  several  authors  [e.g.4-6]  and  is 
proved  to  provide  a  significant  size  reduction  when 
operated  in  the  dominant  mode.  However  a  single  pin 
loaded  patch  suffers  from  the  necessity  to  keep  close 
proximity  between  the  pin  and  the  feeding  probe, 
which  raises  a  mechanical  difficulty.  The  use  of  a 
patch  with  two  shorting  pins  with  a  given  angular 


separation  should  alleviate  this  difficulty,  besides 
providing  multiband  operation  with  controlled  ratio 
of  resonant  frequencies. 

In  this  paper  we  consider  a  circular  patch  with  two 
shorting  pins.  The  aim  is  to  deduce  the  characteristics 
of  the  first  few  cavity  modes  in  terms  of  their 
resonant  frequencies  and  their  radiation  behavior.  To 
our  knowledge,  no  analysis  has  been  presented  for 
this  patch  configuration.  So,  we  introduce  here  a 
rigorous  analysis  to  determine  the  resonant 
frequencies,  field  distribution  and  radiation  character 
of  the  dominant  modes.  In  the  next  section  we 
briefly  derive  the  modal  equation  for  the  cavity 
modes  and  obtain  their  modal  fields.  In  section  3, 
modal  radiation  fields  and  power  are  addressed.  We 
compare  numerical  results  obtained  by  theory  with 
some  simulations  performed  on  the  Zealand  IE3D 
software  in  section  4. 

IL  CAVITY  MODES  OF  A  2-PIN-LOADED 
CIRCULAR  PATCH 

We  consider  a  circular  patch  of  a  radius  ‘a’  on  a 
grounded  dielectric  layer  of  thickness  ‘/i’  and  relative 
dielectric  constant  ‘e/  with  z  axis  coinciding  with  the 
patch  axis.  Two  thin  shorting  pins,  each  of  radius  ‘Z?’ 
are  placed  at  (r^,  ^=±a)  as  shown  in  Fig.l.  Adopting 
the  cavity  model  for  the  patch,  the  boundary  r=a  is 
considered  to  behave  as  a  magnetic  wall.  We  wish  to 
find  the  fields  and  the  resonant  frequencies  of  the 
normal  cavity  modes  in  the  presence  of  the  two 
shorting  pins.  We  start  by  assuming  z-oriented 
currents  /j  j  cxp(J(0^t)  flowing  in  the  two  pins, 

where  cq.  is  the  (so  far  unknown)  modal  resonant 
frequency.  For  even  modes  (having  even  function 
of  0 ),  we  have  1^=  I  •  Conversely,  for  odd 

modes,  /,  =  -/2  =  / .  Due  to  the  smallness  of  the  pins 
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radii,  the  currents  can  be  considered  to  be 
concentrated  on  the  axes  of  the  pins. 


Fig.l :  A  circular  patch,  of  radius  a  with  two  shorting 
pins  of  radii  b  and  angular  distance  2oc;  ro  is  the  radial 
position  of  the  pins. 

Following  the  authors’  work  in  [6],  such  modal  pin 
currents  produce  a  modal  electric  field  given  by: 


COSW0 

sin  ncp 


for  0  ^  r  <  rp  ,  and, 


('■.  ^)  =  (^  / 'o )  ^  "  O')  ±  5  ((/> + a)] 


=  (2I  lJtr,)5{r-r,)'^X. 

«=0 


COS  COS  na^ 
sin  rKp  sin  na  ^ 


(3) 


and  the  discontinuity  of  by  this  current  leads  to 
the  determination  of  the  coefficients  En  as: 


E„=-jcofiJXn 


cos  na 
sin  na 


JMYXka)-J:ika)YSK) 


JXka) 


(4) 


where  x„=^^or  n>  1 ,  and  ;^o  =  for  n^O. 

Now,  the  modal  equation  for  the  resonant  frequency 
is  obtained  by  imposing  the  boundary  condition  of 
vanishing  at  the  pin  surface.  Since  the  pin  radius 
is  assumed  very  small  relative  to  the  field 
variation,  we  are  allowed  to  satisfy  the  vanishing  E^ 
at  one  line  on  the  pin  surface;  say  the  line  {r=ro-b, 
(j)=a)  to  get,  after  some  manipulations: 


Y,(kb)±Y,(kd)-4^X„  [Wr-o -b)J„(kr,)] 

n=0 


costal 
sin^  aj 


YXka)/JU^a)^0 


(5) 


JXkr)Y:(ka)-Y„{kr)J:(ka) 
JXkr,)YXka)-Y„{krMka) 

(2) 

for  <  r  <  a , where  the  cos  and  sin  functions  relate 
to  even  and  odd  modes  about  (|)=0  respectively.  The 
functions  and  };(.)  are  the  Bessel  functions  of 

first  and  second  kind;  the  prime  denotes 
differentiation  with  respect  to  the  argument  and 
k^co^  Ic ,  with  c  the  wave  velocity  in  free  space. 

Note  that  Ej,  is  readily  continuous  at  while 

dEJdr  =  Q  at  r  =  a  satisfying  the  boundary 

condition  at  the  magnetic  wall.  Next,  we  use  the 
discontinuity  of  due  to  the  pin  currents  at  r=ro. 
The  latter  can  be  expressed  as: 


COS  n(p 
sin  n(p 


where  d  is  the  distance  between  the  two  pins  and  the 
upper/  lower  terms  correspond  to  even/odd  modes 
respectively.  The  resonant  frequencies  are  obtained 
by  solving  the  modal  equation  (5)  for  the  set  of 
discrete  values  of  ka  =  co^  a^j^  /  c  ,  with  0^  =  ccvi, 

(XV2...  which  are  the  modal  resonant  frequencies. 
Universal  curves  for  the  normalized  resonant 
frequency  ka  of  the  first  two  even  modes  and  the 
first  odd  mode  are  given  versus  half  pin  separation 
angle  ‘a’,  for  fixed  r^la  and  b/a  in  Fig.2.  These 
modes  are  the  dominant  modes  that  are  expected  to 
have  good  radiation  efficiency.  Here  ‘a'  should  be 
taken  as  the  effective  radius  of  the  patch  after 
accounting  for  field  fringing.  It  is  seen  that  dual  or 
triple  band  operation  is  possible  if  the  feed  excites 
efficiently  the  corresponding  modes.  It  is  useful  to 
note  here  that  the  placement  of  the  feeding  probe  in 
the  plane  ^=0  will  excite  the  even  modes  only, 
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leading  to  dual  band  operation.  Setting  the  feed  at 
0^  0  will  excite  the  odd  mode  as  well.  In  Fig  3,  we 

show  the  ratio  of  the  resonant  frequency  of  the  first 
two  even  modes  versus  the  angle  a.  It  is  seen  that 
dual  band  operation  with  frequency  ratio  ranging 

Modal  Resonant  Frequencies, 


0  10  20  30  40  50  60  70  80  90 

Angle  o 


Fig.2:  Normalized  resonant  frequency  ka  for  first  2 
even  modes  and  the  first  odd  mode  versus  angle  a. 
Here  Z?/ae=0.03  and 
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Fig.3  :  Ratio  of  resonant  frequencies  of  the  second  to 
first  even  modes  versus  angle  a. 


III.  RADIATION  CHARACTER 
OF  THE  MODES 

In  this  section  we  derive  the  radiation  fields  and 
power  of  each  of  the  natural  modes  of  the  patch.  The 
starting  point  is  to  find  the  aperture  field  at  r^a. 
Using  (2)  and  (4)  with  we  get: 


from  ~  1 .6  to  3  is  possible  by  the  right  choice  of  ‘a 
For  example  dual  band  operation  at  the  second 
generation  GSM  (2G)  centered  at  925  MHz  and  the 
third  generation  (3G)  IMT-2000  centered  at  2035 
MHz  is  possible  by  the  choice  a=56°.  In  this  case,  the 
effective  patch  radius  =35. 2mm  on  a  substrate  with 
er=2.2.  Note  that  the  actual  patch  radius  will  be  less 
because  of  field  fringing.  In  the  above  example,  the 
required  radius  a=3 1 ,6mm  when  the  substrate  height 
h=l.  04  mm.  Another  point  that  can  be  inferred  from 
Fig.2  is  that  the  first  even  mode,  which  is  the 
dominant  cavity  mode  of  the  patch,  possesses  the 
least  resonant  frequency.  This  means  that  for  a  given 
operating  frequency,  this  mode  provides  patch  area 
saving.  This  agrees  with  results  obtained  earlier  [6] 
for  the  dominant  mode  of  a  single  pin  loaded  patch. 


/i=0 


cos  710 
sin  n(t> 


-2(OrMoI 
K  ka 


Y  Jn^krp) 


COS  na  cos  n 
sin  wasinw 


0 

0 


(6) 


where  the  Ean  factors  are  defined  by  (6)  for  even  and 
odd  modes.  The  fields  excited  by  this  aperture  field 
in  the  outer  region  r  >  aand  z  >  h  have  been  derived 
in  [6].  In  the  far  zone  fields  are  derived  as: 


Eg(R,e,<l>)^-koah(e-^'^'^IR) 


sin0) 


cos  n(l> 
s  sin  n<l> 


(7) 
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E^(R,e,^)  =  koah(e'^''<‘'^ /R)  cosO 


J „(k(,a sin 6) f  sin n(t>  ^ 
^ocsind  y- cos  n<l> 


(8) 


where  (R,  6,  <p)  are  the  usual  spherical  coordinates, 
kQ  =  COyjJi^ , and  kQR  » \ .  The  total  radiated 
power  is  given  by ; 


Khi^ 


r 

\ 
g=0 


^(1  1 1 Xn)  I-,  where 
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A  ,  n^'jkp  ,2 


Jnigr) 

(9) 


dga 


Mode  Radiation  Quality  Factors 


:  isi 

_ _ 

even  mod 

2nd 

even  mode 

^  — 

SSSSi 

1st  0( 

Jd  Mode 

ro/ae=0.7 

b/ae=0.0: 

5 

0  20  40  60  80 

Angle  a 


Fig.4:  Radiation  Quality  factor  of  the  first  two  even 
modes  and  the  first  odd  mode. 


obtained  as  Q,=^(0,W/P^,  The  quality  factors  for 
the  first  two  even  modes  and  the  first  odd  mode  of  a 
2-pin  loaded  patch  are  plotted  versus  the  angle  a  in 
Fig.4.  It  is  clear  that  the  quality  factor  of  the  first 
even  mode  is  significantly  higher  than  those  of  the 
second  even  and  first  odd  mode.  This  means  that  the 
first  even  mode  is  a  less  efficient  radiator  than  the 
other  two  modes. 

IV.  SIMULATION  RESULTS 

In  order  to  support  the  presented  theory  several 
simulations  are  carried  out  using  Zealand  IE3D 
software.  The  geometry  considered  is  that  shown  in 
Fig.l  and  the  geometry  parameters  are  summarized 
for  four  simulation  cases  in  Table  1.  The  magnitude 
of  the  reflection  coefficient  Sn,  relative  to  a  50 
feed  line,  is  shown  in  dB  versus  frequency  in  Figures 
5-7.  The  angle  a  is  taken  equal  to  30^  in  Fig.5,  60^  in 
Fig.  6,  and  72^  in  Fig.7.  In  all  simulations,  e,  =2.2, 
r(/ae=0J5,  b/ae=0.03,  and  h/ae=0.2,  where  a,  is  the 
effective  patch  radius  taking  the  field  fringing  into 
account  [7]. 


simulated  ISU] ,  a  =30® 


The  integration  is  taken  over  the  radial  wavenumber 
g  in  the  range  {0-ko)  and  can  be  readily  evaluated 
numerically.  The  sum  can  be  truncated  at  n=3  or  4 
with  no  significant  error.  The  energy  stored  "W  in 
the  patch  cavity  for  a  given  mode  at  resonance  can  be 

obtained  by  integrating  in  (1-2)  over  the  cavity 
volume.  Now,  the  mode  radiation  quality  factor  Qr  is 


Fig.5:  |S1 1|  in  dB  for  a=30''  and  two  feed  positions 

We  have  chosen  the  actual  patch  radius  such  that  the 
first  mode  resonates  at  925  MHz.  With  a  =30^,  the 
feed  is  located  at  (ro,0y  =0)in  for  the  solid  curve 

where  the  first  two  even  modes  appear  at  -900  MHz 
and  2575  MHz.  As  expected,  no  odd  mode  is  excited 
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in  this  case  since  the  feed  is  located  at  a  null  for  these 
modes.  When  the  feed  is  moved  to  =40^)  for 

the  dashed  curve  in  Fig. 5,  the  first  odd  mode  is 
excited  at  a  resonant  frequency  fj  odd  =2062  MHz,  in 
addition  to  the  first  two  even  modes. 


0.5  1  1.5  2  2.5  3 

Frequency  (GHz) 


Fig.6:  |S1 1|  for  a=60®  and  feed  at  (ro,45°) 

Next  consider  the  simulations  for  a=60^  and  72^  in 
Fig.  6  and  7  respectively.  For  such  large  values  of  a, 
the  modal  currents  are  small  near  the  axis  (t)=0.  So,  if 
the  feed  were  located  at  (t)=0,  the  input  impedance 
would  be  too  high  to  achieve  matching.  We  therefore 
choose  (|)f=45®  and  50®  in  Fig.6  and  7.  In  Fig.6  for 
a=60®,  only  two  modes  appear  since  the  second  even 
mode  and  the  first  odd  mode  have  their  resonant 
frequencies  so  close  (see  Fig.2)  that  they  appear  as 
one  mode.  For  a=72^,  three  modes  appear  with  the 
odd  mode  having  a  resonant  frequency  higher  than 
that  of  the  second  even  mode  in  agreement  with 
theoretical  results  of  Fig.2. 

The  simulated  resonant  frequencies  are  compared 
with  the  theoretical  values  in  the  first  row  of  Table  2 
and  it  is  seen  that  the  difference  between  simulated 
and  theoretical  results  ranges  from  0.3%  and  3%. 
This  is  largely  caused  by  the  definition  of  the 
effective  patch  radius  based  on  electrostatic 
analysis  [7]  while  it  is  expected  that  it  must  change 
with  frequency  from  one  mode  to  another. 


0.5  1  1.5  2  2.5  3 

frequency  (GHz) 


Fig.7:  |S1 1|  for  a=72®  and  feed  at  (ro,50®) 

It  is  clear  that  for  the  chosen  feed  location  ((|)f=50^), 
the  odd  mode  is  not  well  matched.  There  was  no 
attempt  to  optimize  the  feed  location,  but  it  is 
probably  difficult  to  achieve  matching  for  the  three 
modes  with  a  single  feed.  For  example,  it  may  be 
necessary  to  use  a  matching  circuit  for  the  odd  mode 
in  Fig.7.  Matching  techniques  as  described  in  [8]  may 
then  be  applied. 

Considering  the  bandwidth s  of  the  modes  in  Fig.6, 
we  find  that  the  -10  dB  bandwidth  of  the  900  MHz 
mode  is  about  36  MHz  which  amounts  to  4%.  The 
bandwidth  of  the  second  mode  is  about  175  MHz 
around  a  center  frequency  of  1950  MHz  which  is 


Table  1 

Geometrical  parameters  of  Simulation  cases 


No. 

Angle 

a 

a 

(mm) 

^effective 

(mm) 

/z(mm) 

Feed  at 

fo<l)0 

1 

30 

27.95 

31.11 

6.19 

(23.3, o'") 

2 

30 

27.95 

31.11 

6.19 

(23.3,40'’) 

3 

60 

32.36 

36.04 

7.21 

(27.0,45“) 

4 

72 

33.77 

37,61 

7.52 

(28.2,50“) 

24 


ACES  JOURNAL,  VOL.  18,  NO.  4,  NOVEMBER  2003 


equivalent  to  9%.  These  bandwidths  are  adequate 
for  dual  band  operation  for  the  2G  GSM  and  IMT- 
2000  provided  that  the  second  center  frequency  is 
shifted  to  2035MHz  by  changing  a  to  56°  as  stated 
earlier. 


Table  2 

Theoretical  versus  Simulated  results 
on  modal  resonant  frequencies 


Angle 

a" 

r  TH/f  SIM 
Ijc  'he 

(MHz) 

r  TH/r  SlM 
he  'he 

(MHz) 

f  TH/f  SIM 
Ilo  /Ilo 

(MHz) 

30" 

925/900 

2629/2575 

2068/2062 

60“ 

925/912 

1887/1937 

2047/1937 

72° 

925/900 

1661/1712 

2088/2025 

V.  CONCLUSIONS 

The  cavity  modes  on  a  circular  microstrip  patch 
antenna  loaded  by  2  shorting  identical  pins  have  been 
rigorously  treated.  The  resonant  frequencies  of  the 
even  and  odd  modes  are  obtained  as  the  solution  of  a 
derived  modal  equation  and  it  is  found  that  the  lowest 
two  even  modes  and  the  first  odd  mode  provide  the 
possibility  of  dual  or  triple  band  operation.  The  pin 
separation  angle  2a  acts  a  controlling  parameter  for 
the  modal  resonant  frequencies  and  therefore 
determines  the  ratios  between  the  operating  bands  in 
dual  or  triple  band  operation.  Simulations  performed 
on  the  Zealand  IE3D  software  support  the  theoretical 
results.  As  expected,  the  position  of  the  feeding  probe 
determines  the  relative  excitation  of  modes  and  input 
impedance  at  their  resonant  frequencies.  Results 
show  that  it  is  feasible  to  have  a  dual  band  operation 
with  good  matching  using  a  single  feed.  It  may  be 
difficult  though  to  achieve  good  matching  for  the 
three  modes  in  a  triple  band  operation,  whence  it  may 
be  necessary  to  use  some  matching  technique  at  one 
of  the  modes.  The  results  are  pertinent  to  antenna 
design  for  wireless  communication. 
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Abstract:  Characteristics  and  radiation  patterns 
of  many  antenna  geometries  and  antenna  arrays 
can  be  evaluated  but  not  easily  visualized.  This 
paper  presents  a  software  package  that  has  been 
developed  to  allow  for  2D  and  3D  visualization 
of  the  radiation  patterns  for  many  different  types 
of  antennas  and  antenna  arrays.  The  package 
allows  the  user  to  visualize  the  field  patterns  for 
a  given  type  of  antenna,  and  to  display  the 
constituent  parameters  (input  impedance, 
directivity,  gain,  etc).  The  user  may  inspect  the 
field  pattern  for  a  single  element  of  many 
different  types  of  antennas  (such  as  dipole,  loop, 
aperture)  or  for  arrays  of  common  elements.  The 
parameters  for  these  antennas  or  arrays  may  be 
varied  manually  or  via  an  automated  swept 
parameter  menu.  The  program  allows  for  the 
design  and  study  of  diverse  antenna  arrays. 
Common  types  of  1-D,  2-D,  and  3-D  arrays  are 
available,  as  well  as  a  builder  for  an  arbitrary 
system  of  elements.  Synthesis  and  simulation 
tools  are  also  integrated  into  the  package  to 
allow  for  automatically  detennining  the  best 
configuration  for  an  array  or  an  element  to  meet 
a  predetermined  radiation  characteristic. 

1.  Introduction 

In  the  course  of  designing  an  antenna  element  or 
an  array  of  elements  it  often  becomes  useful  to 
have  a  method  of  visualizing  the  radiation 
pattern  and  for  determining  the  constituent 


parameters  of  the  antenna  system.  This  program 
allows  for  the  design  and  visualization  of  both 
single  elements  and  of  arrays  of  common 
elements.  The  visualization  option  in  the 
program  allows  for  the  inspection  of  the 
radiation  pattern  in  full  3-D  or  in  multiple  2-D 
and  3-D  plane  cuts.  A  great  advantage  is  gained 
by  being  able  to  quickly  and  efficiently  examine 
the  radiation  pattern  in  various  manners.  The 
ability  to  examine  the  field  structure  for  many 
common  types  of  antennas  and  antenna  arrays 
enhances  the  educational  and  research  value  of 
this  package.  In  addition  to  being  able  to 
examine  the  field  structure  of  the  antenna 
element  or  array,  the  package  also  allows  for 
certain  observational  calculations  to  be  displayed 
as  well. 

2.  Single  Element  Simulation 

Since  the  radiation  patterns  and  constituent 
parameters  for  many  common  types  of  elements 
are  well  known,  calculating  the  radiation 
patterns  is  performed  in  a  straightforward 
manner.  The  user  first  selects  one  of  the  element 
types  given  (dipole,  loop,  helix,  infinite 
biconical,  aperture,  or  comer  reflector).  This  will 
bring  up  the  appropriate  basic  pattern  and  initial 
parameters  for  this  type  of  element.  Many  of  the 
element  types  have  various  configurations  and 
sub-types  available  in  the  program.  For  example 
if  a  dipole  is  selected,  the  user  may  choose  the 
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type  of  dipole  (thin  wire,  thick  cylindrical,  small 
vertical  with  ground  plane,  or  small  horizontal 
with  ground  plane),  and  then  set  the  parameters 
for  the  dipole  (frequency/wavelength,  dipole 


length,  maximum  current,  and  far  field  distance). 
All  these  parameters  will  be  used  to  calculate  the 
field  pattern  to  be  displayed  and  its  calculated 
parameters  (gain,  directivity,  etc.),  in  separate 


I  Type'"' 

!|  I  Dipole 
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Figure  1.  The  program  windows  showing  a  dipole  pattern  for  L=1.5X. 


windows,  as  seen  in  Figure  1 . 

Once  the  antenna  type  has  been  selected  and  the 
parameters  for  the  antenna  have  been  entered, 
the  program  calculates  the  E  and  H  field  patterns 
from  this  information.  The  main  program  passes 
off  the  entered  parameters  into  the  appropriate 
module  to  perform  the  field  and  parameters 
calculations.  By  using  a  modular  system,  adding 
new  features  such  as  new  elements  or  arrays, 
becomes  a  simple  matter  of  loading  the 
appropriate  module.  The  program  then  calculates 
the  magnitude  and  phases  of  each  field 
component  at  a  user  selectable  number  of  points 
in  0  (elevation)  and  (p  (azimuth)  from  the 


gathered  information.  By  varying  the  number  of 
points  or  the  step  size  in  each  direction,  the  user 
may  increase  or  decrease  the  resolution  of  the 
pattern  being  generated.  This  saves 
computational  time  when  the  pattern  is  relatively 
smooth,  and  allows  for  fine  detail  when  the 
pattern  is  more  complex. 

In  the  main  window  along  with  the  antenna 
parameters,  are  the  visualization  parameters. 
These  allow  for  the  user  to  select  the  type  and 
way  the  pattern  is  viewed.  The  user  may  select 
firom  a  list  of  available  patterns  and  field 
components  (Total  E  Field,  Total  H  Field,  E- 
Theta,  E-Phi,  H-Theta,  H-Phi,  Radiation 
Intensity,  etc.),  which  allows  the  user  to  examine 
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both  the  total  field  characteristics  and  the 
individual  directional  characteristics  as  well.  The 
user  may  also  choose  which  format  they  wish  to 
see  the  pattern  displayed  in,  either  in  a  linear 
relative  format  or  in  a  normalized  dB  format. 
When  the  normalized  dB  format  is  selected,  the 
user  may  enter  the  Dynamic  Range  for  data  to  be 
shown.  Since  in  dB  format  the  data  can  range 
from  0  down  to  -oo,  the  range  is  a  useful  tool  to 
examine  either  the  major  features  of  the  field 
pattern  or  can  be  varied  down  to  show  even 
small  fluctuations.  One  of  the  most  useful  tools 
in  the  package  allows  for  the  combination  of 


plane  cuts  and  transparency  settings  for  the 
displayed  pattern.  When  the  full  pattern  is 
displayed,  the  transparency  slider  sets  the 
transparency  for  the  whole  figure  from 
completely  transparent  to  completely  opaque. 
When  one  of  the  3-D  plane  cuts  is  selected,  the 
slider  will  go  from  just  showing  a  small  slice  of 
the  pattern  as  seen  in  Figure  2,  to  showing  a 
slightly  transparent  half  of  the  pattern  at  the 
halfway  point,  and  up  to  a  completely  opaque 
figure  at  its  highest  setting.  Likewise  the 
program  has  the  ability  to  show  plain  2-D  cuts  as 
well  for  any  figure  type. 


Figure  2.  The  pattern  window  showing  various  3-D  and  2-D  plots. 
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3.  Swept  Parameters 

A  very  useful  tool  in  antenna  design  and 
simulation  allows  for  the  program  to  sweep  over 
a  set  range  of  parameters.  With  the  Swept 
Parameter  tool  the  user  may  select  any  one  of  the 
active  parameters  to  sweep  across.  The  user  then 
selects  the  starting  and  ending  points  for  a 
parameter  to  sweep  across  and  also  enters  the 
number  of  increments  to  be  used.  When  started 
the  program  will  begin  to  sweep  across  this 


parameter  and  display  the  results  either  as  an 
animated  figure  in  the  pattern  window  (for  the 
type  of  plot  being  displayed)  or  they  may  step 
through  the  sweep  one  point  at  a  time  to  examine 
the  results.  This  ability  not  only  aids  in  the  fine 
tuning  of  a  desired  element  by  showing  the 
changes  in  the  pattern  for  small  increments,  but 
also  adds  to  the  software  packages  educational 
value  as  well. 
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Figure  3.  Sweeping  a  dipole  from  L=lm  to  2m  (X=lm). 
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4.  Antenna  Arrays  phases  of  the  elements  (either  single  elements, 

rows  of  elements,  or  planes  of  elements 
One  of  the  main  features  of  the  program  allows  depending  on  the  array  type)  and  the 

for  the  design  of  any  type  of  array  of  common  corresponding  array  factor  is  generated.  The 

elements.  From  the  main  window  the  user  may  array  factor  itself  may  be  viewed  or  a  composite 

select  any  one  of  the  1-D,  2-D,  or  3-D,  array  pattern  of  the  array  made  with  a  selected  element 

types.  These  include  Linear,  Circular,  Planar,  can  be  viewed.  When  an  array  is  chosen  an 

Cubical,  Spherical,  or  the  completely  arbitrary  element  pattern  window  will  open  allowing  the 

array.  In  all  but  the  arbitrary  array  the  user  user  to  choose  which  antenna  will  be  used  for 

simply  selects  the  desired  array,  enters  the  the  individual  elements, 

number  of  elements,  the  relative  amplitudes  and 


Figure  4.  Radiation  patterns  for  arrays  of  isotropic  elements  for,  (a)  Fourier  synthesized  1-D  array, 
(b)  circular  2-D  array,  (c)  planar  array,  and  (d)  spherical  array. 
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^  Array  Element  Control 


Figure  5.  The  array  element  window  and  arraybuilder  window. 


A  few  common  types  of  antenna  elements  are 
available  here  as  a  matter  of  convenience,  as 
well  as  an  option  for  the  user  to  load  a  pattern 
file.  This  file  may  be  generated  by  this  program 
earlier  while  designing  a  single  element  or  may 
contain  data  generated  elsewhere.  This  allows 
for  the  use  of  the  patterns  generated  from  other 
programs  or  gathered  from  real  antermas  on  an 
antenna  range  to  be  used  as  elements  when 
analyzing  arrays. 

Alternatively  the  program  allows  for  the  analysis 
and  visualization  of  completely  arbitrary  arrays 
of  elements.  In  order  to  accomplish  this,  the 
program  includes  what  is  known  as  the 
arraybuilder.  This  is  a  small  subprogram  that 
allows  for  the  layout  and  viewing  of  the  array 
elements.  The  user  may  choose  the  location  in  x, 
y,  and  z  directions  (all  points  are  relative  to  each 
other  and  the  distance  is  measured  in 
wavelengths)  and  the  relative  amplitude  and 
phase  of  each  element.  The  arraybuilder  allows 
for  the  interactive  placement  and  updating  of 
these  elements,  as  changes  are  shown 
immediately  in  the  arraybuilder  window  shown 
in  Figure  3.  Elements  may  be  added,  updated  or 
deleted  at  will,  and  as  can  be  seen  in  the  figure. 


the  currently  selected  element  is  highlighted  in 
blue. 

5.  Conclusions 

The  program  successfully  allows  the  user  to 
interactively  design  and  visualize  many  common 
types  of  antennas  as  well  as  arrays  of  elements. 
It  has  great  utility  not  only  in  its  use  as  a  design 
program  for  antennas,  but  as  a  learning  tool  as 
well.  It  can  allow  the  user  to  interactively 
explore  antenna  patterns  and  its  properties  and 
promotes  greater  imderstanding  of  antenna 
design.  The  element  design  features  provide  a 
good  platform  for  design  and  visualization.  The 
array  features  allow  for  the  interactive  design 
and  visualization  of  many  different  types  of 
arrays  and  allow  for  the  testing  and  verification 
of  array  designs. 
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Abstract 

A  full-wave  electromagnetic  model  is  developed  and 
verified  for  a  waveguide  transition  consisting  of  slotted 
rectangular  waveguides  coupled  to  a  strip  line.  This 
waveguide-based  structure  represents  a  portion  of  the 
planar  spatial  power  combining  amplifier  array.  The 
electromagnetic  simulator  is  developed  to  analyze  the 
stripline-to-slot  transitions  operating  in  a  waveguide- 
based  environment  in  the  X-band.  The  simulator  is 
based  on  the  method  of  moments  (MoM)  discretiza¬ 
tion  of  the  coupled  system  of  integral  equations  with 
the  piecewise  sinusodial  testing  and  basis  functions  in 
the  electric  and  magnetic  surface  current  density  ex¬ 
pansions.  Electric  and  magnetic  dyadic  Green’s  func¬ 
tions  used  in  this  integral  equation  formulation  are  de¬ 
veloped  for  an  infinite  rectangular  waveguide  in  the 
form  of  partial  expansion  over  the  complete  system 
of  eigenfunctions  of  a  transverse  Laplacian  operator. 
Numerical  results  are  obtained  and  compared  with  a 
commercial  microwave  simulator  for  a  few  represen¬ 
tative  structures,  including  various  configurations  and 
planar  arrays  of  slotted  waveguide  modules  coupled  to 
a  strip  line. 

I.  Introduction 

Military  and  civilian  applications  require  sizable 
power  at  microwave  and  millimeter- wave  frequen¬ 
cies.  Medium  to  high  power  levels  are  needed  for 
applications  such  as  communications,  active  missile 
seekers,  radar,  and  millimeter-wave  imaging.  To  meet 
this  need,  klystrons,  traveling-wave  tubes,  and  gridded 
tubes  are  heavily  utilized  to  generate  medium  to  high 


power.  However,  tubes  are  bulky,  costly,  require  high 
operating  voltages,  and  have  a  short  lifetime.  As  an 
alternative,  solid-state  devices  offer  several  advantages 
such  as,  lightweight,  smaller  size,  wider  bandwidths, 
and  lower  operating  voltages.  These  advantages  lead 
to  lower  cost  because  systems  can  be  constructed  us¬ 
ing  planar  fabrication  techniques.  However,  as  the  fre¬ 
quency  increases,  the  output  power  of  solid-state  de¬ 
vices  decreases  due  to  their  small  physical  size.  There¬ 
fore,  to  achieve  sizable  power  levels  that  compete  with 
those  generated  by  vacuum  tubes,  several  solid-state 
devices  can  be  combined  in  an  array.  Conventional 
power  combiners  are  effectively  limited  in  the  num¬ 
ber  of  devices  that  can  be  combined.  To  overcome 
these  limitations  and  produce  high  power  levels  at  mi¬ 
crowave  and  millimeter-wave  frequencies,  in  a  past  few 
years  there  has  been  a  considerable  activity  in  develop¬ 
ing  spatial  power  combining  systems  [1],  [2],  [3].  The 
output  power  of  individual  solid-state  devices  in  a  pla¬ 
nar  array  is  combined  to  produce  mo  derat  e-to-high 
power  levels.  It  is  desirable  to  utilize  a  single  solid- 
state  amplifier,  however,  as  frequency  increases,  the 
output  power  levels  become  low  due  to  the  1//^  fall- 
off  of  available  power  [4].  By  utilizing  power  combin¬ 
ing  techniques  light-weight,  reliable,  and  low  cost  am¬ 
plifiers  and  oscillators  can  be  potentially  designed  to 
meet  the  demand  of  military  and  civilian  applications. 

Fundamental  understanding  of  spatial  power  com¬ 
bining  systems  has  primarily  been  done  by  experimen¬ 
tal  investigation.  Several  experimental  free  space  and 
dielectric  quasi-optical  power  combiners  and  waveg¬ 
uide  spatial  power  combiners  have  been  successful  at 
demonstrating  the  fundamental  concept  of  generating 
usable  output  power  levels  using  spatial  and  quasi- 
optical  techniques.  Although  great  strides  have  been 
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made,  to  date,  quasi-optical/spatial  power  combin¬ 
ing  systems  have  not  yet  out  performed  conventional 
power  combiners.  In  order  to  capture  the  full  potential 
of  quasi-optical/spatial  systems  to  generate  high  power 
levels,  numerical  modeling  and  computer  aid  engineer¬ 
ing  tools  are  needed  to  fully  understand  these  systems 
[5].  The  development  of  computer  models  helps  to  re¬ 
duce  the  cost  and  time  associated  with  experimental 
work,  and  computer  models  assist  with  designing  ef¬ 
ficient  quasi-optical/spatial  power  combining  systems. 
Modeling  a  quasi-optical/spatial  power  combining  sys¬ 
tem  is  complex  and  challenging.  There  are  several 
major  system  components  that  must  be  modeled  such 
as,  the  input  and  output  sources,  which  are  typically 
waveguide  horns  with  optical  lenses  inside,  the  input 
and  output  antennas  with  associated  transmission  lines 
and  control  components,  and  the  active  integrated  am¬ 
plifier  circuitry. 


between  the  middle  and  upper  waveguides.  The  strips 
are  located  inside  of  the  middle  waveguide  region.  The 
objective  of  stripline-to-waveguide  transition  is  to  effi¬ 
ciently  couple  energy  from  the  lower  waveguide  to  the 
upper  waveguide.  An  incident  electromagnetic  field  is 
illuminated  at  the  input  port  of  the  lower  waveguide. 
This  signal  travels  into  the  lower  waveguide  and  in¬ 
duces  magnetic  currents  on  the  lower  slots  where  the 
slots  scatter  energy  into  the  lower,  middle,  and  upper 
waveguides.  In  the  middle  waveguide  region  the  scat¬ 
tered  fields  induce  electric  currents  and  standing  waves 
along  the  strips.  The  scattered  energy  from  the  strips 
along  with  the  scattered  energy  from  the  lower  slots 
induce  magnetic  currents  in  the  upper  slots.  The  mag¬ 
netic  currents  in  the  upper  slots  cause  scattered  fields 
back  into  the  middle  waveguide  and  into  the  upper 
waveguide  region.  Optimum  performance  is  achieved 
by  varying  the  distance  between  the  slots,  adjusting 
the  slot  dimensions,  rotating  the  slots,  or  varying  the 
stripline  dimensions. 


PLANAR  WAVEGUIDE  SPATIAL  POWER  COMBINING 
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Fig.  1.  Aperture-coupled  stripline-to-waveguide  transition. 

In  this  paper,  an  electromagnetic  modeling  environ¬ 
ment  is  developed  for  an  aperture-coupled  stripline- 
to-waveguide  transition  (with  geometry  shown  in  Fig. 
1).  This  transition  is  the  fundamental  building  block 
for  two-dimensional  spatial  power  combining  amplifier 
arrays  shown  in  Fig.  2  and,  in  turn,  for  the  planar 
quasi-optical  power  combining  systems.  The  transition 
consists  of  three  infinite  aperture-coupled  rectangular 
waveguides.  The  lower  slots  (apertures)  are  located 
on  the  surface  between  the  lower  and  middle  waveg¬ 
uides,  and  the  upper  slots  are  located  on  the  surface 


Fig.  2.  Aperture-coupled  planar  waveguide  amplifier  array. 


A  full-wave  electromagnetic  model  is  developed  for 
a  structure  that  couples  a  waveguide  to  a  stripline 
through  a  set  of  slots  and  from  the  stripline  through 
another  set  of  slots  into  a  second  waveguide.  The  sys¬ 
tem  modeling  is  based  on  an  integral  equation  formu¬ 
lation  for  the  induced  electric  and  magnetic  surface 
current  densities  resulting  in  a  coupled  set  of  inte¬ 
gral  equations  discretized  via  the  method  of  moments 
(MoM).  The  scattered  electric  and  magnetic  fields  are 
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expressed  in  terms  of  dyadic  Green’s  functions  and  the 
electric  and  magnetic  surface  currents.  Electric  and 
magnetic  dyadic  Green’s  functions  are  developed  for 
an  infinite  rectangular  waveguide  in  the  form  of  par¬ 
tial  expansion  over  the  complete  system  of  eigenfunc¬ 
tions  of  a  transverse  Laplacian  operator.  The  surface 
currents  are  discretized  by  overlapping  piecewise  sinu- 
sodial  subdomain  basis  functions  in  order  to  accurately 
model  narrow  longitudinal  strips  and  transverse  slots. 
In  this  formulation,  a  MoM  matrix  includes  all  possible 
self  and  mutual  coupling  effects  between  the  slots  and 
strips.  The  transition  is  excited  with  the  TEio  dom¬ 
inant  waveguide  mode,  and  the  scattering  parameters 
are  calculated  from  the  forward  and  backward  coupling 
coefficients  in  the  waveguide  regions. 

Numerical  results  of  the  scattering  characteristics 
are  obtained  and  compared  with  a  commercial  mi¬ 
crowave  simulator  for  a  few  representative  structures, 
including  a  single  slot-strip-slot  waveguide  transition, 
multiple  slot-strip-slot  waveguide  transitions,  and  pla¬ 
nar  arrays  of  slotted  waveguide  modules  coupled  to 
strip  lines. 


Fig.  3.  Geometry  of  a  closed-boundary  waveguiding  structure 
containing  apertures  and  conducting  strips  in  the  presence 
of  an  impressed  electric  current  source. 


-jujU  /  Gej  {r,f')  ■  Ji„d{f')dS' 

JSe 

-[  GEM{f,f')-M{r')dS’  (1) 


II.  Theory 

A  general  electromagnetic  formulation  for  a  closed¬ 
boundary  waveguiding  structure  containing  arbitrarily 
shaped  apertures  and  conducting  strips  (see  Fig.  3) 
is  presented  in  this  section.  This  structure  is  a  gen¬ 
eral  building  block  of  the  aperture- coupled  stripline- 
to- waveguide  transition  shown  in  Fig.  1.  The  formula¬ 
tion  is  based  on  the  integral  representation  of  incident 
and  scattered  electric  and  magnetic  fields  in  terms  of 
dyadic  Green’s  functions  [6],  [7].  Dyadic  Green’s  func¬ 
tions  represent  the  electric  and  magnetic  fields  at  an 
observation  point  inside  a  volume  due  to  an  arbitrar¬ 
ily  oriented  point  source.  Fig.  3  shows  an  arbitrary 
volume  V  enclosed  by  the  surface  S  —  S  U  Smi  where 
S  represents  an  electric-type  boundary  surface  and  Sm 
represents  the  surface  of  apertures  (magnetic- type  sur¬ 
face).  The  volume  V  encloses  an  impressed  electric  vol¬ 
ume  current  source  Jimp  C  Vimp  and  an  electric  current 
source  Jind  induced  on  the  surface  of  conducting  strips 
56(010  ctric-type  surface). 

The  integral  representations  for  the  total  electric  and 
magnetic  fields  in  volume  V  due  to  the  impressed  and 
induced  currents  are  obtained  as  follows 

E{f)  =  -jujid  f  Gej  (r.f')  •  Jimp{r')dV' 

t<mp 


H{f)  =  /  Ghj  (r,  f)  ■  Jimp{f')dV' 

+  /  Ghj  (r,f')  ■  Jind(r')dS' 

JS, 

-M  f  Ghm  {r,r')  ■  M{r')dS'  (2) 

where 

GHj{r,f')  =  VxGEj{f,r'),  (3) 

Gem  {r,f')  =  VxGHM{r,f').  (4) 

Here,  the  electric-electric  dyadic  Green’s  function,  Gej 
(f,  r'),  relates  the  electric  field  in  volume  V  enclosed 
by  surface  S  to  the  impressed  electric  current  source 
Jimpi'^)  ^  Vimp  and  the  induced  electric  surface  cur¬ 
rent  Jind{^)  €  the  electric-magnetic  dyadic  Green’s 

function.  Gem  relates  the  electric  field  in  the 

volume  V  to  the  equivalent  magnetic  surface  current 
M{f)  €  Sm\  the  magnetic-magnetic  dyadic  Green’s 

function,  Ghm  relates  the  magnetic  field  in  the 

volume  V  to  the  equivalent  magnetic  surface  current 
M(r )  6  5m,  and  the  magnetic-electric  dyadic  Green’s 

function,  Ghj  relates  the  magnetic  field  in 

the  volume  V  to  the  impressed  electric  current  source 
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Jimpir)  G  Vimp  and  the  induced  electric  surface  cur¬ 
rent  Jind{T)  c  5e.  These  electric  and  magnetic  dyadic 
Green’s  functions  are  developed  for  an  infinite  rectan¬ 
gular  waveguide  in  the  form  of  partial  expansion  over 
the  complete  system  of  eigenfunctions  of  a  transverse 
Laplacian  operator  [8]. 


r  ‘ 

\  j' 

\  Lt///  G/// 

J 

S', 

w, 

y„  \  G  n 

XiT; 

eL,hL  _S_ 

UJ  e;,h; 

Fig.  4.  Field  analysis  in  terms  of  incident  and  scattered  electric 
and  magnetic  fields  due  to  induced  electric  and  magnetic  sur¬ 
face  currents  in  the  aperture-coupled  stripline-to-waveguide 
transition. 

The  integral  equation  formulation  discussed  above 
was  applied  for  the  analysis  of  a  structure  that  couples 
the  lower  waveguide  (region  Vj)  to  the  striplinc  (re¬ 
gion  Vii)  through  a  set  of  slots  and  from  the  stripline 
through  another  set  of  slots  into  the  upper  waveguide 
(region  Vm)  (Fig.  4).  The  geometry  shown  in  Fig.  4  is 
a  unit  cell  of  a  general  topology  of  the  aperture-coupled 
stripline-to-waveguide  transition  (Fig.  1).  Fig.  4  shows 
a  field  analysis  in  terms  of  incident  and  scattered  elec¬ 
tric  and  magnetic  fields  in  three  different  regions  due 
to  induced  electric  and  magnetic  surface  currents.  A 
coupled  system  of  equations  is  obtained  by  imposing  a 
continuity  of  tangential  magnetic  fields  across  the  sur¬ 
faces  of  lower  and  upper  slots,  and  ^2,  respectively, 
and  an  electric-field  boundary  condition  on  the  surface 
of  the  strip,  ^3, 

y  X  Hlncir)  =  y  X  [H[\f)  -  Hl{r)  +  Hi\r) 

+Hl\f)],  f  eSi  (5) 

0  =  y  X  [Hl'ir)  +  Hi' in  “  Sfl^'in 

+Hl^{f)],  r  €  S2  (6) 

0  =  y  X  [El^{f)  +  El\f)  +  El\f)\,  f  €  53  (7) 


or  it  can  be  written  in  the  integral  form  in  terms  of 
corresponding  dyadic  Green’s  functions  and  impressed 
and  induced  surface  current  densities 

y  X  Hlncif)  = 

y  X  \jueji  f  Ghm  (^>^')  •  Mi{f')dS' 

JSi 

-ju)ei  f  Ghm  '  Mi[f')dS' 

JSi 

-jueir  [  Ghm  {r,f')-M2{f')dS' 

JS2 

+  /  G//J  (r,r")  •  J(r')c?5'],  re  Si  (8) 

JSs 

0  =  y  X  [jioeir  f  Ghm  (^> ^0  ‘  Mi{f')dS' 

Js, 

f  Ghm  (^>^0  •  M2{f')dS' 

JS2 

r  =iii  ,  .  ,  , 

+ji^eiir  /  Ghm  '  ^2{r  )dS 
JS2 

+  /  Ghj  ‘  ‘/(^V‘5'']:  r  G  52  (9) 

JSz 


^  =  Gem 

Jsi 

+  f  Gem  (^1^')  •  M2{f')dS' 

JS2 

[  Gej  (ry)  •  J{r')dSl  re  Ss.  (10) 

JS3 

Here,  5'4^(r )  is  the  incident  magnetic  field  gener¬ 
ated  in  the  lower  waveguide;  and  M2(r^)  are 

the  equivalent  magnetic  surface  currents  induced  on 
the  surfaces  of  the  lower  and  upper  slots,  Si  and  S2, 
respectively;  J{f')  is  the  electric  surface  current  in¬ 
duced  on  the  surface  of  the  strip,  5$;  Ghm  (F,r^), 

Ghj  Gem  (F,r '),  and  Gej  (r,f')  are  the  elec¬ 

tric  and  magnetic  Green’s  dyadics  of  the  corresponding 
waveguides  (regions  K/,  V//,  and  V///).  The  surface 
currents  are  discretized  by  overlapping  piecewise  sinu- 
sodial  subdomain  basis  functions.  In  this  formulation, 
a  MoM  matrix  includes  all  possible  self  and  mutual 
coupling  effects  between  the  slots  and  strips.  The  tran¬ 
sition  is  excited  with  the  TEio  dominant  waveguide 
mode  and  the  scattering  parameters  are  calculated 
from  the  forward  and  backward  coupling  coefficients 
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in  the  waveguide  regions.  The  details  of  the  method  of 
moments  discretization  technique  of  the  slotted  waveg- 
uide  transitions  and  dyadic  Green’s  functions  applied 
in  this  formulation  can  be  found  in  [8]. 

III.  Numerical  Results  and 
Discussions 

Numerical  results  of  the  scattering  characteristics 
were  obtained  and  compared  with  a  commercial  mi¬ 
crowave  simulator  for  a  few  representative  structures 
shown  in  Fig.  5,  including  a  single  slot-strip-slot 
waveguide  and  multiple  slot-strip-slot  waveguide  tran¬ 
sitions.  Also,  planar  arrays  of  slotted  waveguide  mod¬ 
ules  coupled  to  a  strip  line  (with  geometries  shown  in 
Figs.  8  and  10)  are  investigated. 


Fig.  5.  Top  view:  (a)  one  lower  slot,  one  strip,  and  one  upper 
slot;  (b)  two  lower  slots,  one  strip,  and  two  upper  slots;  (c) 
same  as  (b)  but  one  lower  slot  and  one  upper  slot  are  offset, 

(d)  three  lower  slots,  one  strip,  and  three  upper  slots;  and 

(e)  one  lower  slot,  two  strips,  and  one  upper  slot. 

Here  we  present  numerical  results  of  the  scattering 
characteristics  for  the  examples  of  the  double  slot- 
strip-slot  waveguide  transition  with  two  shifted  slots 
(case  (c)  in  Fig.  5)  and  two  strips  coupled  to  two  slots 
(case  (e)  in  Fig.  5).  In  both  examples,  the  upper  and 
lower  X-band  waveguide  dimensions  are  22.86  mm  x 
10.16  mm,  ej  —  £///  =  1.0,  while  the  middle  waveguide 
dimensions  are  22.86  mm  x  1.5748  mm  (62  mils),  and 
Eij  —  1.0.  In  the  case  shown  in  Fig.  5(c),  the  spacing 
between  the  lower  and  upper  slots  is  19  mm,  while  the 


8.0  8.5  9.0  9.5  10.0  10.5  11.0  11.5  12.0 

Frequency  (dB) 


(a) 


8.0  8.5  9.0  9.5  10.0  10.5  11.0  11.5  12.0 

Frequency  (GHz) 


(b) 


Fig.  6.  MoM  (solid  line)  and  HFSS  (dashed  line)  comparison 
for  the  scattering  parameters  (reflection  and  coupling  coef¬ 
ficients)  for  the  double  slot-strip-slot  waveguide  transition 
with  two  offset  slots.  Magnitude  and  phase:  (a)  5ii  and  (b) 
541 . 

inter-spacing  between  the  two  lower  slots  and  two  up¬ 
per  slots  is  10  mm.  The  length  of  the  strip  is  30  mm, 
the  width  of  the  strip  is  1  mm,  the  length  of  the  slots 
is  13  mm,  and  the  width  of  the  slots  is  1  mm.  The  re¬ 
flection  coefficient  5ii  and  the  coupling  coefficient  ^41 
computed  using  the  MoM  technique  presented  here  and 
the  HFSS  commercial  program  are  compared  in  Fig. 
6.  The  coupling  of  -8.6  dB  occurs  at  approximately 
9.2  GHz. 

In  the  second  example  with  geometry  shown  in  Fig. 
5(e),  the  longitudinal  strip  is  divided  into  two  strips 
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Fig.  7.  MoM  (solid  line)  and  HFSS  (dashed  line)  comparison  of 
the  scattering  parameters  for  one  lower  slot,  two  strips,  and 
one  upper  slot  waveguide  transition.  Magnitude  and  phase: 
(a)  5ii  and  (b)  S41. 


each  10  mm  in  length.  The  length  of  the  lower  and 
upper  slots  is  changed  to  15  mm.  In  the  method  of 
moments  program,  the  slots  and  strips  are  discretized 
in  1  mm  cells.  Both  scattering  parameters  peak  at  10 
GHz  (Fig.  7),  and  both  magnitudes  of  Su  and  ^41 
reach  a  peak  value  of  approximately  -5.5  dB  and  -7.2 
dB,  respectively. 

Fig.  8  shows  the  geometry  of  the  1x2  waveguide 
coupler  array  which  consists  of  2  transitions  in  series 
shown  in  Fig.  5(a).  Transitions  are  separated  by  40 


Fig.  8.  Top  view  of  the  slotted  1x2  waveguide  array  with  a 
unit  cell  shown  in  Fig.  5(a)  consisting  of  one  lower  slot,  one 
strip,  and  one  upper  slot. 


Fig.  9.  MoM  (solid  line)  and  HFSS  (dashed  line)  comparison  of 
Sii  for  the  1x2  one  lower  slot,  one  strip,  and  one  upper 
slot  waveguide  coupler  array,  (a)  Magnitude  and  (b)  phase. 
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mm  with  respect  to  the  center,  the  waveguide  length 
is  90  mm,  and  ej  =  en  —  ejjj  =  1.0. 

Fig.  9  compares  the  MoM  simulations  and  HFSS 
results  for  the  magnitude  and  phase  of  the  reflection 
coefficient  5ii,  The  maximum  and  minimum  values, 
-8.3  dB  and  -39.5  dB,  of  the  reflection  coefficient  occur 
at  8.7  GHz  and  11.2  GHz,  respectively. 

Fig.  10  shows  the  2x2  slotted  waveguide  array 
which  consists  of  four  transitions  with  the  geometry  of 
a  unit  cell  shown  in  Fig.  5(a).  The  2x2  array  repre¬ 
sents  two  waveguide  couplers  separated  by  a  distance 
of  23  mm.  The  length  and  width  of  the  slots  are  13  mm 
X  1  mm,  and  the  length  and  width  of  the  strip  are  30 
mm  X  1  mm,  respectively.  Both  the  lower  and  upper 
waveguide  width  and  height  dimensions  are  46.0  mm  x 
10.16  mm  and  ej  =  ejjj  —  1.0.  The  middle  waveguide 
dimensions  are  46.0  mm  x  1.5  mm  and  eu  =  2.2. 


Fig.  10.  Top  view  of  a  2  x  2  one  lower  slot,  one  strip,  and  one 
upper  slot  waveguide  array. 


Fig.  11  shows  the  MoM  simulations  for  the  magni¬ 
tude  and  phase  of  5ii  for  the  2x2  waveguide  array. 
A  total  of  m  =  n  =  125  modes  were  utilized  to  simu¬ 
late  the  array,  and  the  cell  size  of  slots  and  strips  were 
discretized  into  1  mm  increments.  The  minimum  value 
of  -32.4  dB  of  5ii  occurs  at  approximately  9.75  GHz. 


IV.  Conclusion 


In  this  paper  we  presented  the  analysis  of  aperture- 
coupled  stripline-to-waveguide  transitions  used  in  pla¬ 
nar  spatial  power  combining  systems.  The  method  of 
analysis  is  based  on  the  integral  equation  formulation 
for  the  unknown  electric  and  magnetic  surface  currents 
with  electric  and  magnetic  dyadic  Green’s  functions  of 
infinite  rectangular  waveguide.  The  method  of  mo¬ 
ments  discretization  with  piecewise  sinusoidal  testing 


0. 

0) 


CL 
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Fig.  11.  MoM  simulation  of  5ii  for  a  2  x  2  one  lower  slot,  one 
strip,  and  one  upper  slot  waveguide  array,  (a)  Magnitude 
(solid  line)  and  (b)  phase  (dashed  line). 


and  basis  functions  reduces  a  coupled  set  of  integral 
equations  to  a  matrix  equation.  Numerical  results  ob¬ 
tained  for  the  scattering  parameters  of  various  slot- 
strip-slot  waveguide  transitions  and  arrays  compare 
well  with  the  results  calculated  by  the  Finite  Element 
Method  commercial  program  (HFSS). 
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ACCURACY  OF  THREE  UNCONDITIONALLY-STABLE  FDTD  SCHEMES 
FOR  SOLVING  MAXWELL’S  EQUATIONS 


Guilin  Sun  and  Christopher  W.  Trueman 

Electromagnetic  Compatibility  Laboratory,  Concordia  University, 
7141  Sherbrooke  Street  West,  Montreal,  Quebec,  Canada  H4B  1R6 


Abstract  -  This  paper  discusses  accuracy  limitations  due 
to  numerical  dispersion  and  time  step  size  for  three 
implicit  unconditionally-stable  FDTD  methods: 
Alternate-Direction-Implicit  (ADI),  Crank-Nicolson 
(CN)  and  Crank-Nicolson-Douglas-Gunn  (CNDG).  It  is 
shown  that  for  a  uniform  mesh,  the  three  methods  have 
the  same  numerical  phase  velocity  along  the  axes,  but 
have  large  differences  along  the  diagonals.  The  ADI 
method  has  two  orders-of-magnitude  larger  anisotropy 
than  that  of  CN  and  CNDG.  CNDG  has  no  anisotropy 
at  certain  Courant  numbers  and  mesh  densities.  At  the 
limit  of  zero  spatial  mesh  size,  the  three  methods  have 
different  “intrinsic  temporal  dispersion”  for  a  given 
time  step  size:  CN  has  no  anisotropy;  ADI  has  positive 
anisotropy  and  CNDG  has  negative  anisotropy,  which  is 
much  smaller  than  ADI.  The  Nyquist  sampling 
theorem  provides  a  fundamental  upper  bound  on  the 
time  step  size  for  all  three  methods.  It  is  shown  that  for 
ADI  and  CN  the  practical  upper  bound  is  close  to  the 
Nyquist  limit,  but  for  CNDG  is  half  the  Nyquist  limit. 

Keywords  -  Maxwell’s  Equations,  Finite-difference  time- 
domain  method,  Alternate-Direction-Implicit  method, 
Crank-Nicolson  method,  Douglas-Gunn  method, 
numerical  dispersion,  numerical  anisotropy,  accuracy, 
Nyquist  criterion. 


I.  Introduction 

Yee’s  Finite-Difference  Time-Domain  (FDTD) 
method  is  popular  for  solving  Maxwell’s  Equations  [1].  It 
is  second-order  accurate  in  both  time  and  space  [2].  Since  it 
is  an  explicit  method  [3],  it  is  easy  to  program  and  efficient 
to  run.  However  it  suffers  from  the  Courant-Friedrich-Levy 
(CFL)  limit  or  the  Courant  limit  on  the  time  step  size 
required  for  stability.  For  objects  with  fine  geometrical 
features,  using  a  fine  mesh  size  greatly  reduces  the 
allowable  time  step  size,  which  causes  the  CPU  time  to  be 
prohibitively  long.  To  eliminate  the  CFL  limit, 
unconditionally-stable  methods  working  with  large  Courant 
numbers  are  desirable.  Early  in  1984,  Holland  [4]  proposed 
an  implicit  method  but  it  was  not  completely  stable.  In 
1995,  Shang  [5]  developed  an  efficient  characteristic-based 
algorithm  and  Fijany  [6]  proposed  a  parallel  Crank- 
Nicolson  (CN)  method  by  decomposition  of  the 
eigenvalue/eigenvector  for  the  wave  equations  of  the 
second  order.  In  1999  and  2000,  Namiki  [7]  and  Zhen  et  al. 


[8]  suggested  an  Altemate-Direction-Implicit  (ADI) 
method.  In  2001,  Beggs  and  Briley  [9]  reported  a  two- 
factor  scheme  by  combining  a  characteristic-based 
approach  to  spatial  differencing  with  an  implicit  lower- 
upper  approximate  factorization  that  avoids  the  solution  of 
a  tridiagonal  system.  Very  recently.  Sun  and  Trueman  [10] 
proposed  a  Crank-Nicolson  scheme  with  Douglas-Gunn 
algorithm  (CNDG). 

This  paper  compares  the  numerical  dispersion  and 
anisotropy  of  the  ADI,  CN  and  CNDG  methods  used  in 
FDTD,  and  analyzes  some  of  their  common  characteristics 
and  differences.  For  simplicity,  Yee’s  mesh  [1]  in  2D  is 
used  with  a  TE2  wave  in  a  linear,  isotropic,  non-dispersive 
and  lossless  medium.  This  paper  is  organized  as  follows. 
In  Section  II,  the  amplification  factors  are  listed  for  the 
three  methods.  In  Section  III,  their  numerical  dispersion 
relations  are  given  and  compared  for  a  given  mesh  density. 
In  Section  IV  their  numerical  anisotropy  is  analyzed  and 
compared  at  different  Courant  numbers.  In  Section  V  the 
time  step  size  limits  are  discussed  and  related  to  the 
Nyquist  criterion.  In  Section  VI  the  accuracy  limit  due  to 
dispersion  is  analyzed  for  zero  spatial  mesh  size. 

II.  Amplification  Factors 

The  update  equations  for  the  ADI-FDTD,  CN- 
FDTD  and  CNDG-FDTD  methods  are  listed  in 
Appendices.  The  ADI-FDTD  uses  two  sub-steps  [7] 
(Appendix  I).  The  first  sub-step  advances  time  from  step  n 
to  step  /j+1/2.  The  field  component  is  fully  implicit 

and  requires  solving  a  tridiagonal  matrix.  The  second  sub¬ 
step  advances  time  step  from  «+l/2  to  «+l,  and  the  field 

component  is  fully  implicit  and  also  requires  solving 
a  similar  tridiagonal  matrix.  The  time  step  «+l/2  is 
intermediate  and  the  field  values  at  this  step  are  non¬ 
physical.  The  amplification  factors  for  the  two  sub-steps 
are  [7,1 1] 


1  +  r 


y  ^-ytan-‘((l+r,0(l+r;)-l) 


1  +  r; 


(1) 


^2  = 


tan“'  ((l+r/Xl+r^^  )-l) 


(2) 
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where  j  =  =  cA/  /  2)  /  A^  ,  - 

cA/  sin(^^ 12)1  Ay,  At  is  the  time  step  size,  c  =  1  / -Jfje 
is  the  physical  velocity,  e  and  p  are  the  permittivity  and 
permeability  of  the  material  respectively.  Ax  and  Ay  are 
the  spatial  meshing  sizes  along  x  and  y  axes; 
P^  =  Pco5{<f),  Py=psm{<l>),  P'^=PI+PI  where  p  is 
the  numerical  wave  phase  constant;  and  angle  ^  is  the 

direction  of  travel  with  respect  to  the  x  axis. 

The  CN  scheme  averages  the  field  components  at 
time  step  n  and  n+\  to  maintain  second  order  accuracy  in 
time  [6, 10]  (Appendix  11).  It  has  no  intermediate  time  step. 
But  the  resulting  block  tridiagonal  matrix  is  very  expensive 
to  solve  by  direct  methods  such  as  the  Gaussian 
elimination  or  the  banded  matrix  method,  as  well  as  by 
iterative  methods  such  as  Successive  Over-Relaxation 
(SOR)  or  the  iterative  Altemate-Direction-Implicit  (lADI) 
method  [3].  Since  the  discretization  of  the  Poisson 
Equation  and  heat  equation  also  leads  tosu  ch  a  block 
tridiagonal  matrix,  several  other  methods  to  solve  it  can  be 
found  in  Refs.  [3]  and  [12].  Finjany  et  al.  [6]  solve  the 
block  tridiagonal  matrix  by  eigenvalue/eigenvector 
decomposition. 

Different  solution  methods  for  the  block 
tridiagonal  matrix  may  have  different  amplification  factors 
and  different  numerical  dispersion  relations.  This  paper 
assumes  a  direct  solution  method  of  the  block  tridiagonal 
matrix  such  as  the  Gaussian  elimination  for  the  CN-FDTD 
method.  The  amplification  factor  for  this  CN  method  is 
[10] 


±  j  tan“ 


^CN 


\-rt-rt 


(3) 


Note  that  the  magnitude  of  the  amplification  factor  for  the 
CN  method  is  unity. 

The  CNDG  method  avoids  an  expensive  direct 
solution  (Appendix  III).  It  factorizes  the  block  tridiagonal 
matrix  and  has  two  sub-steps:  the  first  sub-step  finds  the 
intermediate  field  value  H ^ ,  and  the  second  sub-step  gets 
the  solution  at  the  time  step  n+1  [10].  Thus,  like  the  ADI 
method,  the  CNDG  method  needs  to  solve  a  tridiagonal 
matrix  at  each  sub-step.  The  amplification  factor  is 


±  j  tan' 


^CNDG 


(l-r/Xl-r/) 


(4) 


magnitude.  On  the  other  hand,  in  ADI,  the  intermediate 
time  is  («-i-l/2)Ar,  and  each  sub-step  has  its  own 
amplification  factor.  The  magnitudes  of  the  two 
amplification  factors  for  the  two  individual  sub-steps  are 
reciprocals.  Their  product  is  the  magnitude  of  the 
amplification  factor  for  one  fiill  update  cycle,  and  is  unity. 
Note  that  the  unity  magnitude  of  the  amplification  factors 
ensures  that  the  three  implicit  methods  are  unconditionally 
stable  and  strictly  non-dissipative. 

III.  Numerical  Dispersion 

The  numerical  dispersion  relation  can  be  written 
for  the  three  methods  [10,  1 1]  as 


tan^(®A; /2)  =  +  grjrj 

where  the  factor  g  is 

1  ADI 

g  = .  0  CN 

-  tan^(^yA//2)  CNDG 

Given  a  frequency  co  ,  a  mesh  size  Ax  and  Ay  ,  a  time  step 
size  A^,  and  direction  of  travel  (/> ,  Eqn.  (5)  is  solved 
implicitly  for  the  phase  constant  p.  For  a  wave  to 
propagate  p  must  be  real.  The  numerical  dispersion  is 
quantified  by  the  relative  velocity  that  is  defined  as  the 
ratio  of  the  numerical  velocity  to  the  physical  velocity. 


Fig.  1  Numerical  dispersion  with  mesh  density  100  and 
s=l,  5  and  10,  for  ADI,  CN  and  CNDG. 


Though  ADI  and  CNDG  both  have  two  sub-steps,  there  are 
some  differences.  In  CNDG,  the  time  level  of  the 
intermediate  step  is  unknown,  and  there  is  only  one 
amplification  factor  for  the  two  sub-steps,  of  unity 


The  numerical  dispersion  relation  in  Eqn.  (5)  for 
the  ADI  and  CNDG  methods  has  been  validated  with 
numerical  experiments  [10,11].  Fig.l  shows  the  relative 
velocity  as  a  function  of  the  direction  of  travel,  using 
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Ax  =  Ay  with  mesh  density  =  100  cells  per  wavelength, 

at  different  Courant  numbers  s  =  cAt/  Ax  equal  to  1,  5  and 
10.  Note  that  the  numerical  dispersion  curves  of  the  CN 
method  and  the  CNDG  method  are  almost  identical,  and 
are  difficult  to  distinguish  visually  in  Fig.l.  The  relative 
velocity  along  the  axes  is  exactly  the  same  for  the  three 
methods,  but  along  the  diagonal  ADI  is  quite  different 
from  CN  or  CNDG.  Thus  their  numerical  anisotropies  are 
significantly  different,  which  will  be  discussed  in  the  next 
section. 

The  “grid-related  numerical  dispersion”  is  defined 
as  the  dispersion  at  zero  timest  ep  size.  Since  the  cross 
term  gr^ry  in  Eqn.  (6)  goes  to  zero  as  time  step  size 

approaches  zero,  the  grid-related  numerical  dispersion  is 
the  same  for  the  three  methods,  and  is  the  same  as  that  of 
Yee’sFDTD  [2,13]. 

IV.  Numerical  Anisotropy 

The  velocity-anisotropy  error  is  often  used  to 
quantify  the  numerical  anisotropy  [2].  In  an  isotropic 
medium,  the  wavefront  of  a  cylindrical  wave  is  a  circle, 
that  is,  the  phase  velocity  is  the  same  in  all  directions. 
However,  in  the  numerical  domain,  the  numerical  wave 
velocity  usually  depends  on  the  direction  of  travel.  Like 
Yee’s  FDTD  with  a  uniform  mesh,  the  ADI  and  CN 
methods  have  the  largest  velocity  along  the  diagonals  and 
slowest  along  the  axes,  as  shown  in  Fig.  1. 

However,  the  CNDG  behaves  differently  from 
CN  or  ADI.  At  certain  combinations  of  the  mesh  and  time 
step  sizes,  the  velocity  M450  along  the  diagonals  can  be 
slower  than  the  velocity  Uqo  along  the  axes.  To  evaluate 
this  phenomenon,  the  following  definition  of  the  velocity 
anisotropy  is  used  for  a  uniform  mesh 

Au=  xl00%  (j) 

min{M45o,MQo}  ^  ^ 

This  error  definition  has  the  same  magnitude  as  that 
Taflove  and  Hagness  [2],  but  can  be  positive  or  negative, 
depending  on  which  velocity  is  larger.  Fig.  2  and  Fig.  3 
show  the  numerical  anisotropy  at  mesh  densities  from  50  to 
100  cells  per  wavelength  for  the  ADI,  CN  and  CNDG 
methods.  The  anisotropy  of  ADI  is  about  two  orders  of 
magnitude  larger  than  that  of  CN  and  CNDG.  As  the 
Courant  number  increases,  the  anisotropy  in  ADI  increases 
significantly.  The  CN  and  CNDG  have  the  same  behaviour. 
For  the  Courant  number  smaller  than  about  11,  the 
anisotropy  of  CNDG  is  always  smaller  than  that  of  CN;  for 
the  Courant  number  larger  than  11,  the  anisotropy  of 
CNDG  is  larger  than  that  of  CN  at  coarse  mesh  density  (for 
example,  50),  but  it  quickly  becomes  smaller  than  that  of 
CN  after  certain  mesh  densities.  For  instance,  if  the  Courant 
number  is  12,  at  the  mesh  density  50,  the  numerical 


anisotropy  is  -0.148*0.01  for  CNDG  and  0.082*0.01  for 
CN;  at  mesh  density  55,  their  absolute  anisotropy  values 
are  the  same,  about  0.066*0.01.  However,  at  mesh  density 
60,  CN  has  0.055*0.01  but  CNDG  has  only  0.038*0.01. 


50  60  70  B0  ^10  100 


Mesh  Density  {  flOI  1 

Fig.  2  Numerical  anisotropy  of  the  ADI  method  at  Courant 
numbers  1,5  and  10. 


50  60  70  80  <10  100 


Mesh  Density  (CN  ^  CNDGl 

Fig.  3  Numerical  anisotropy  for  CN  and  CNDG. 

Notice  that  the  anisotropy  changes  its  sign  for 
CNDG  at  certain  combinations  of  the  mesh  density  and 
Courant  number.  This  suggests  that  at  certain  Courant 
numbers,  a  mesh  may  have  no  anisotropy.  Fig,  4  shows  the 
relation  between  the  Courant  number  and  mesh  density 
with  zero  anisotropy.  Note  that  neither  ADI  nor  CN  has 
this  behavior. 

V.  Time  Step  Size  Limitations 

In  the  numerical  dispersion  relation  of  Eqn.  (5),  if 
the  tangent  is  infinite,  the  phase  constant  becomes  complex 
for  all  the  three  methods.  This  limit  is  reached  when 


(8) 
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Fig,  4  The  relation  of  the  Courant  number  and  mesh 
density  at  zero  anisotropy  for  CNDG. 


Fig.  5  Time  step  size  limits  with  respect  to  mesh  density  for 
the  three  methods. 


Fig.  6  Intrinsic  temporal  anisotropy  with  zero  mesh  size 
for  ADI. 


Fig.  7  Intrinsic  temporal  anisotropy  with  zero  mesh  size  for 
CNDG. 


Eqn.  (8)  is  recognized  as  the  Nyquist  limit  for  the  time 
sampling.  For  a  given  mesh  density  N,  the  Nyquist  limit 
can  be  written  in  relation  to  the  Courant  number  as 


sin 


l  +  -\Jl-ta.n*{o)At /2) 
(5  tan(^yA//2))^ 


(10) 


However,  Eqn.  (5)  implies  a  smaller  time  step  size  limit 
than  the  Nyquist  limit.  For  example,  along  the  x-axis, 
sin(/?Ax/2)  must  be  smaller  than  or  equal  to  one  if  P  is  to 
be  real. For  the  ADI  and  CN  methods,  the  minimum 
velocity  always  occurs  along  the  axes,  so  the  limiting 
Courant  number  for  a  give  mesh  density  can  be  found  from 
Eqn.  (5)  to  satisfy  tan(;zs ! N)-s  .  For  the  CNDG  method, 
at  larger  Courant  numbers,  the  minimum  velocity  is  along 
the  diagonals.  For  Ax  =  Ay,  Eqn.  (5)  for  the  CNDG 

method  leads  to 


where  P^^^  is  the  phase  constant  along  the  diagonals.  If 
tan(£yAr  /  2)  >  1 ,  then  the  phase  constant  p  becomes  a 
complex  number.  Thus  tan(iyA/  /  2)  =  1  is  the  limit  for  the 

CNDG  method.  This  corresponds  to  .y  =  ?//4  .  The  three 
curves  shown  in  Fig.  5  are  the  Nyquist  limit,  and  the  step 
size  limits  for  the  ADI  and  CN  methods  and  for  the  CNDG 
method.  If  a  method  uses  a  Courant  number  larger  than  its 
limit  shown  in  Fig.  5,  numerical  attenuation  will  occur, 
which  does  not  correspond  to  the  physical  reality, 

VI.  Intrinsic  Temporal  Dispersion 

In  Yee’s  FDTD,  as  the  mesh  density  increases,  the 
numerical  dispersion  decreases.  In  the  limit  of  an 
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infmitely-fine  mesh,  that  is,  zero  spatial  mesh  size,  there  is 
no  numerical  dispersion  because  the  Courant  limit  forces 
the  time  step  size  to  be  zero.  Hence,  Yee’s  method 
collapses  to  the  continuous  case,  that  is,  no  discretization 
for  time  and  space.  But  for  the  unconditionally-stable 
implicit  methods,  because  there  is  no  Courant  limit,  fine 
mesh  size  can  be  accompanied  by  very  large  time  step  size 
and  the  method  remains  stable.  Even  in  the  limit  of  zero 
spatial  mesh  size,  the  methods  are  still  stable  for  any  time 
step  size,  but  there  is  numerical  dispersion.  The  numerical 
dispersion  at  zero  mesh  size  is  an  intrinsic  limitation  and 
may  be  termed  “intrinsic  temporal  dispersion”,  previously 
called  “time-splitting-related  dispersion”  [11].  The 
intrinsic  temporal  dispersion  is  different  for  the  three 
methods.  With  some  manipulation,  it  can  be  written  as 


1  +  y]\  +  g(tan(6)At  /  2)  sin  2^)^ 

2  (11) 


method  is  two  orders-of-magnitude  larger  than  that  of  CN 
and  CNDG.  In  the  limit  of  zero  mesh  size,  ADI  and  CNDG 
have  anisotropy  but  the  CN  method  does  not.  Different 
from  the  ADI  and  CN  methods,  the  CNDG  method  may 
have  slower  velocity  along  the  diagonals  than  along  the 
axes,  and  may  have  zero  anisotropy  at  certain  combinations 
of  Courant  number  and  mesh  density.  The  three  methods 
have  time  step  size  limits  that  are  smaller  than  the  Nyquist 
criterion.  CNDG  has  the  smallest  time  step  limit.  The 
intrinsic  temporal  dispersion  is  a  fundamental  accuracy 
limitfor  the  three  methods  and  is  much  larger  for  ADI  than 
for  CN  or  CNDG  along  the  diagonals. 

APPENDIX  I  Update  Equations  for  ADI-FDTD 

The  ADI-FDTD  has  two  sub-steps.  The  first  sub¬ 
step  is  advancing  time  from  step  n  to  step  «+l/2  by  use  of 
the  following  update  equations  [7] 


u  _  coMU 
c  tan(o)At/2) 


Numerical  calculations  using  Eqn.  (11)  show  that  the 
intrinsic  temporal  dispersion  as  a  function  of  the  direction 
of  travel  is  similar  to  that  shown  in  Fig.  1  but  smaller  in 
magnitude.  From  Eqn.  (1 1)  it  can  be  seen  that  the  relative 
velocity  is  not  a  function  of  direction  of  travel  for  the  CN 
method.  Therefore  CN’s  anisotropy  is  zero  at  the  zero 
mesh  size  limit.  But  for  ADI  and  CNDG,  there  is 
anisotropy,  as  shown  in  Fig.  6  and  Fig.  7.  This  anisotropy 
is  termed  the  “intrinsic  temporal  anisotropy.”  Note  that  the 
ADI’s  anisotropy  is  about  30  times  larger  than  that  of 
CNDG  at  the  time  step  size  of  one-tenth  Nyquist  time  step 
size  limit. 

Note  that  in  Eqn.  (1 1)  the  tangent  cannot  be  larger 
than  unity  for  CNDG  method;  otherwise  the  velocity  will 
be  a  complex  number.  This  time  step  size  limit 
corresponds  to 


Ar 


CNDG~ 


2  2/ 


(12) 


This  is  half  of  the  Nyquist  limit,  and  coincides  with  that  in 
Section  V  for  the  time  step  size  limit  of  the  CNDG 
method.  For  non-zero  mesh  size,  the  numerical  velocity  is 
always  smaller  than  the  intrinsic  temporal  dispersion, 
therefore  Eqn.  (1 1)  is  a  fundamental  accuracy  limit  for  the 
three  methods. 


VII.  Conclusion 

This  paper  has  discussed  several  aspects  of  the 
ADI,  CN  and  CNDG  methods.  The  magnitude  of  the 
overall  amplification  factor  for  all  three  methods  is  unity; 
hence  they  are  all  unconditionally  stable.  The  numerical 
dispersion  is  the  same  along  the  axes  for  the  three  methods, 
but  differs  along  the  diagonals.  The  anisotropy  in  the  ADI 


+  1/2,  J)  =  E;(i  + 1/2,  j)  +  a,{ 

H"(i  + 1/2,7  + 1/2)  -  + 1/2,7  -1/2)}/ Ay  (H) 

j  + 1/2)  =  E;(i,j  + 1/2)  -a,{ 

+ 1/2,  J  +  \/2)-H"*''\i  - 1/2,7  + 1/2)}/ Ax  (1-2) 

//;^'^^(/  + 1/2,7 +  1/2)  =  //;(/ +  1/2,7  + 1/2) +  a2{ 

£■;(/  + 1/2,7  + 1)  -  Kii  +  i/2,7)}/Ay 
-a^[Ef^{i  +  \,j  +  \/2)-Ef'\i,j  +  \/2)}/Ax 

(1-3) 

where  =  At !  2s ,  a2  =  A/  /  2// ,  A/  is  the  time  step  size; 
€  and//  are  the  permittivity  and  permeability  of  the 
material  respectively;  Ax  and  Ay  are  the  spatial  meshing 
sizes  along  x  and  y  axes;  i  and  j  are  the  integer-number 
indices  of  the  computational  cells;  and  n  is  the  time  step 
index.  In  this  step,  is  implicit  and  can  be  found  by 

solving  a  tridiagonal  matrix  of  the  form 

E;*''^ (i  - 1,7  + 1  / 2)  -  {(Ax^  / a, a,  +  2)E";''^ (i, 7  +  1/2) 

+  E"/'''^{i  + 1, 7  + 1  /  2)  =  -(Ax^  /  a.flj  )E;  (i,  7  + 1  /  2)  + 
(Ax/aj){//;0  +  l/2,7  +  l/2)-//;(i-l/2,7  +  l/2)} 

+  Ax/Ay{E"^  (i  + 1  /  2, 7  + 1)  -  £;  (i  + 1  /  2, 7)  + 
£;(/-l/2,7)-£;(i-l/2,7  +  l)} 

(1-4) 

The  second  sub-step  advances  time  step  from  n+1/2  to  w+l 
and  the  update  equations  are 
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(/  + 1  /  2,  j)  =  £r' '  ^  (/•  +  1  /  2,  y)  +  a,  { 

H”*\i  +  \ll,j  +  \l2)- //r' O'  +  1  / 2. y  - 1  / 2) } / A>;  (1.5) 

+ 1/2)  =  + 1/2)  -a,{ 

//f''^0  +  l/2.y+l/2)-//r"^(/-l/2,y  +  l/2)}/Ax  (1-6) 

//f'('  +  l/2,y  +  l/2)  =  //r''^('  +  l/2,y  +  l/2)  +  a2{ 

'(1  +  l/2,y  +  l)-£r'(''  +  l/2.y)}/Ay 
-a2{£;"'(/  +  l,y  +  l/2)-£;^'((.y  +  l/2)}/Ax  (1-7) 

Since  £■"■"'  is  implicit  and  it  is  found  by  solving  a 
tridiagonal  matrix  of  the  form 

£;"'(, •  + 1/2, y  - 1)  -  {A/  /a,n2  +  2)£r'0'  +  l/2.y)  + 

(,•  + 1  /  2,  y  + 1)  =  -KAf'  /nia2  )ET'  ' '  O'  + 1  /  2,  y)  + 
(Ay/n2){77r’''('  +  l/2.y-l/2)- 

+  Ay/ Ax{£;*''^((  +  l,y  + 1  /2)  -  +  l/2)  + 

£;"" "  (/  + 1,  y  - 1  /  2)  -  "  (I.  y  - 1  /  2) }  (1-8) 

APPENDIX  II  Update  Equations  for  CN-FDTD 

The  CN  scheme  averages  the  field  components  at 
time  step  n  and  «+l  to  maintain  second  order  accuracy  in 
time  as  follows  [6,  10] 

4'"'(/  +  l/2,y)=^(i  +  l/2,y)  + 
a^{ir;\i  +  l/2,y  + 1/2)  -  ff/\i  +  l/2,y  - 1/2)  + 
tr^ii  +  V2,J  + 1/2)  -/^(i  +  l/2,j- 1/2)}/ Ay 

0',y+l/2)  =  £;(/,y+l/2)- 
a{  //f  ('+l/2y+i/2)-//r'('-l/2,y+l/2)+ 
//;(i+l/2,y+l/2)-//;(i-l/2,y+l/2)}/Av 

//;^'(i  +  l/2,y  +  1/2)  =  //;((  +  1/2,  y  + 1/2)  + 

02  {£;*'  (/  +  1  /  2,  y  +  1)  -  E"/\i  +  1  /  2,  y )  + 

£;(/  +  l/2,y  +  1)  -  £■;(/  +  l/2,y)}/ Ay  - 
(r  +  1,  y  +  1  /  2)  -  £;^'  (/,  y  +  1  /  2)  + 

£;(/  +  l,y  +  l/2)-£;(i,y  +  l/2)}/Ax 

The  block  tridiagonal  matrix  to  be  solved  for  the 
CN  method  is 


(II- 1) 


(II-2) 


(II-3) 


//^^'((+l/2,y+l/2)-fl,a2{//^^'('+3/2,y+l/2)+ 
/^‘((-l/2,y+l/2)-2W^"^'(/+l/2,y+l/2}/At^ 
-a,a2{//^^'0+l/2.y+3/2)+/^^'(/+l/2.y-l/2)- 
2/^'"'(i+l/2,y+l/2)}/A/  = 
//^(i+l/2,y+l/2)+0|a2{//^(/+3/2,y+l/2)+ 

//^(/- l/2,y+l/2)-2/^  (i + l/2,y + 1/2)}/A»7 
+a,a2{//^(/+l/2,y+3/2)+//^(/+l/2,y-l/2)- 

2H^(i+V2,j+V2)}/A/ 

+2a2{[Er,{i+y2,j+\)-Er^(,i+Vlj)/Ay-]- 

[E;ii+ij+\/2)-E;(ij+u2)yAK} 

APPENDIX  III  Update  Equations  for  CNDG~FDTD 

The  update  equations  of  CNDG-FDTD  are  the 
same  as  the  CN  method.  The  difference  between  the  CN- 
FDTD  and  CNDG-FDTD  is  the  method  used  to  solve  the 
block  tridiagonal  matrix.  In  CNDG-FDTD,  it  is  factorized 
into  two  sub-steps  [10] 

Hlii-l/2,j+l/2)-2Hl(i+l/2,J+l/2)}/Ax^ 

=  H"(i+l/2,j+l/2)+a,a2[H"(i+2/2,j+l/2)+ 

//;(z-l/2,y+l/2)-2//;(/+l/2,y+l/2)}/Ax' 

+a,a2{/7;(/+l/2,y+3/2)+//;(/+l/2,y-l/2)- 

2//;(/+l/2,y+l/2)}/A/ 

+2a2{[v5;(/+l/2,y+l)-£;(/+l/2,y)]/Ay- 

[£■;  (i + 1,  y + 1  /  2)  -  E;  (ij + 1  /  2)]/  Ax}  (HM ) 

//f  (/  +  l/2,y  + 1/2)  -  a|02{7/r'(/  +  l/2,y  +  3/2)  + 

//f  (/  + 1/2, y  - 1/2) - 2H",^'(i  +  l/2,y  + 1/2)}/ a/ 

=  //*(/  +  l/2,y  +  l/2) 

At  each  sub-step  a  tridiagonal  matrix  must  be 
solved.  It  can  be  seen  that  the  ADI  method  solves  the 
electric  fields  implicitly,  but  the  CN  and  CNDG  methods 
solve  the  magnetic  field  implicitly  in  the  2D  TE^cas  e. 
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Abstract 

Electromagnetic  research  often  requires  studies 
within  wider  frequency  ranges.  For  achieving  a 
fine  resolution  in  the  frequency  domain,  the 
required  computation  time  is  usually  high.  Here 
the  MoM-based  field  computation  program 
FEKO  working  in  frequency  domain  is  used  for 
this  purpose.  In  order  to  reduce  the  computa¬ 
tional  costs  by  minimizing  the  number  of  sam¬ 
pling  points  used,  the  interpolation  algorithm 
MPBE  (Model  Based  Parameter  Estimation)  is 
applied  to  achieve  a  mathematically  based  ap¬ 
proximation  of  the  problem.  This  paper  pre¬ 
sents  the  acceleration  of  computations  with 
FEKO  using  the  interpolation  algorithm 
MBPE.  A  short  introduction  to  FEKO  is  given 
at  the  beginning.  Subsequently  the  implementa¬ 
tion  of  MBPE  as  well  as  three  possible  adap¬ 
tive  strategies  for  further  shortening  the  compu¬ 
tation  time  is  presented.  Finally  examples  are 
given  that  show  the  advantages  of  this  imple¬ 
mented  method. 

Introduction 

Research  in  the  field  of  electromagnetic  com¬ 
patibility  often  requires  studies  over  large  fre¬ 
quency  ranges.  Since  the  scenarios  to  be  mod¬ 
eled  are  very  complex  (e.g.  in  medical  areas) 
and  also  the  computation  time  needed  for  these 
kinds  of  investigations  is  usually  very  high  a 
new  approach  has  to  be  developed  to  fulfill  all 
these  requirements.  In  this  paper  the  MoM- 
based  field  computation  program  FEKO  is  ac¬ 


celerated  using  the  MBPE  algorithm,  which  is 
presented  in  the  following. 

The  Field  Computation  Program  FEKO 

FEKO  [1],  [2]  is  a  field  computation  program 
considering  objects  of  arbitrary  shape.  It  is 
based  on  a  full  wave  solution  of  Maxwell’s 
equations  in  the  frequency  domain.  The  accu¬ 
rate  Method  of  Moments  (MoM)  formulation  is 
used  to  solve  for  the  unknown  surface  currents. 
Asymptotic  techniques.  Physical  Optics  (PO) 
and  Uniform  Theory  of  Diffraction  (UTD)  have 
been  hybridized  with  the  MoM  in  order  to 
solve  electrically  large  problems.  The  MoM 
has  also  been  extended  to  solve  problems  in¬ 
volving  multiple  homogeneous  dielectric  bod¬ 
ies,  thin  dielectric  sheets  and  dielectric  coated 
wires.  An  approach  is  necessary  to  speed  up 
computations  over  large  frequency  ranges. 
Therefore,  the  acceleration  of  FEKO  with  the 
interpolation  algorithm  MBPE  is  described  in 
the  following. 

Model  Based  Parameter  Estimation  (MBPE) 

In  order  to  reduce  the  computational  costs  by 
minimizing  the  number  of  sampling  points 
used,  the  interpolation  algorithm  MPBE  [3], 
[4],  [5],  [6]  is  applied  to  achieve  a  mathemati¬ 
cally  based  approximation  of  the  problems.  A 
function  sampled  in  the  frequency  domain  is 
approximated  with  MBPE  to  represent  the 
original  function  by  a  reduced-order  physically 
based  approximation  called  a  fitting  model. 
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The  application  of  such  a  fitting  model  means 
interpoiating  between  samples  to  reduce  the 
amount  of  data  needed.  First  frequency  /  is 
normalized  with  respect  to  the  center  frequency 
/,  and  the  width  of  the  frequency  interval  . 


x  = 


f-L 

L 


(1) 


Now  the  fitting  models  are  described  by  a  pole 
series  based  on  the  Fade  approximation  with 
rational  polynomials 


^(/)  = 


^(/) 

I  A./” 

\-0 


(2) 


with  n  is  order  of  the  numerator  and  d  is  or¬ 
der  of  the  denominator.  Basically  there  are 
three  possibilities  for  determining  parameters 
for  fitting  models,  either  from  samples  of  the 
process  to  be  modeled,  alternatively  from  sam¬ 
ples  of  the  derivatives  of  the  process,  or  from  a 
combination  of  both.  Since  the  field  computa¬ 
tion  program  FEKO  is  used  here  and  the  com¬ 
putational  cost  of  derivative  sampling  of  a 
process  would  be  much  higher,  only  function 
sampling  is  used  in  this  paper.  With  sampling 
in  frequency  domain,  there  is  no  need  for  the 
samples  (/)  to  be  uniformly  spaced  as  is  usu¬ 
ally  the  case  in  most  time  domain  solutions. 
That  means  that  the  samples  can  be  chosen  in 
such  a  way  that  the  yield  of  information  for 
each  sample  is  a  maximum.  Equation  (2)  leads 
to 


o(/)F(y:)  =  Ar(y;).  i  =  o....,D-7  (3a) 

where  D{f,)  =  D„+D,f,+DJ,^  +...+Djf  (3b) 

and  N{f,)  =  N,+N,f,+NJ^  +  (3c) 

There  are  D  =  d  +  n  +  2  unknown  coefficients  in 
the  two  polynomials  D(/)  and  N{fi).  An  ad¬ 
ditional  condition  or  constraint  is  needed  and 


so  Dj  =  1  is  chosen  (linear  predictor  constraint). 
With  now  determining  the  D  =  d+n  +  l  un¬ 
known  coefficients  Ng,N,,N2+...  +  N„,D„.D,, 
Di+.-.+Dj,,  of  this  equation  system  a  function 
to  represent  the  original  values  can  be  found. 
Now  the  complete  frequency  range  is  divided 
into  different  windows  and  for  every  window 
an  equation  system  according  to  (3)  is  solved. 
In  the  end  all  fitting  models  are  combined  to 
give  the  overall  solution  function  for  the  entire 
frequency  range. 

A  first  example  (IEEE  German  EMC  chapter: 
benchmark  problem  no.  3)  for  the  functionality 
of  the  algorithms  is  given  with  a  monopole 
antenna  on  an  infinite  ground  plane  and  a 
nearby  wire  loop  (see  figure  1).  The  exciting 
frequency  of  the  antenna  is  varied  in  the  range 
from  /  =  1  MHz  to  30  MHz.  The  resulting 
complex  current  induced  in  the  wire  loop  is 
shown  in  figure  2. 


Fig.  1.  Geometry  in  principle  for  the  bench¬ 
mark  problem  no.  3  defined  by  the 
German  lEEE/EMC  chapter. 

If  30  equidistant  sample  points  (i.e.  sampling 
every  1  MHz)  are  linearly  interpolated  the 
curve  shown  in  figure  2a  results.  In  this  case 
resonant  peaks  are  cut  off.  An  exact  result  can 
be  achieved  using  uniformly  and  closely  sam¬ 
pled  points  (in  this  case  581  samples,  i.e.  sam¬ 
pling  every  50  kHz)  as  shown  in  figure  2b.  The 
same  curve  results  with  using  the  30  sample 
points  and  interpolating  with  MBPE  (see  figure 
2c). 

Overlapping  fitting  models  always  containing 
+1  =  7  sample  points  were  used  to  ap¬ 
proximate  this  problem.  The  orders  of  the 
polynomials  for  every  fitting  model  were  cho¬ 
sen  Nf,=Nij  =  3.  The  first  seven  sample  points 
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were  used  for  determining  the  coefficients  of 
the  first  model  representing  the  first  part  of  the 
solution.  For  the  second  fitting  model  samples 
2  to  8  are  used  to  compute  its  model  coeffi¬ 
cients  and  so  on.  In  the  end  all  fitting  models 
were  set  together  to  achieve  the  overall  solution 
with  only  30  sample  points.  The  computation 
time  rises  linearly  with  the  number  of  sample 
points  used  in  total,  so  in  this  case  the  overall 
computation  was  accelerated  with  using  the 
MBPE  algorithm  by  an  acceleration  factor  of 

^  =  19.4. 

30 


Fig.  2.  Complex  current  induced  in  a  wire 
loop. 


Adaptive  Sampling 

With  “adaptive  sampling”  a  more  flexible  algo¬ 
rithm  is  applied  to  fiirther  minimize  the  number 
of  required  sample  points  by  exploiting  the  fact 


that  in  the  frequency  domain  no  uniformly 
spaced  samples  are  required.  The  computation 
is  started  with  a  small  set  of  sampling  points 
and  with  these  a  set  of  overlapping  windows  is 
determined.  The  whole  frequency  range  is  cov¬ 
ered  by  overlapping  windows  so  that  a  maxi¬ 
mum  error  between  two  fitting  models 


E. 


(4) 


can  be  determined  for  every  frequency.  Where 
the  error  is  a  maximum,  a  new  sample  point 
will  be  added.  If  a  window  is  containing  too 
much  sample  points,  it  will  be  split  up  into  two. 
The  algorithm  is  terminated  when  the  error 
falls  under  a  certain  threshold. 

The  second  adaptive  sampling  approach  im¬ 
plemented  is  using  a  strategy  based  on  Rom¬ 
berg’s  method.  The  procedure  starts  with  an 
interval  containing  five  uniformly  spaced  sam¬ 
ples.  With  these  samples  three  trapezoidal  val¬ 
ues  are  computed  (with  samples  1,3,5;  1,2,3 
and  3,4,5)  and  an  error  estimation  between 
these  values  indicates  whether  a  new  sample  is 
required  in  this  interval.  If  that  is  the  case,  two 
new  subintervals  are  formed  from  each  half  of 
the  original  interval  and  Romberg’s  method  is 
applied  to  both  of  them.  If  new  samples  are 
required  in  either  of  these  subintervals,  new 
sample  points  are  added  where  indicated.  This 
process  is  repeated  until  the  error  falls  under  a 
chosen  threshold. 

The  third  approach  to  reduce  the  number  of 
sampling  points  needed  is  a  Genetic  Algorithm 
(GA)  to  find  the  orders  of  the  numerator  and 
the  denominator  for  a  fitting  model  represent¬ 
ing  the  whole  frequency  range.  It  is  started  with 
a  very  simple  model  with  low  order  (e.g. 

+1  =  4).  There  are  now  various  possi¬ 
bilities  for  the  orders  of  the  numerator  and  the 
denominator  (  Af„  =  0,Wo  =  3  /  N^  =  \:Na  =  2 
/...).  If  the  three  best  models  of  this  generation 
are  not  sufficient  in  representing  the  final  re¬ 
sult,  a  new  sample  point  is  added  and  a  new  set 
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of  models  (with  N^+No+\  =  5)  will  be  deter¬ 
mined.  In  this  way  the  number  of  sampling 
points  is  consequently  increased  during  the 
steps  of  the  approximation  process  until  the 
error  between  the  three  best  models  falls  under 
a  given  threshold. 

Results 

All  three  adaptive  strategies  are  now  used  to 
achieve  an  accurate  result  with  less  sample 
points.  For  the  benchmark  problem  described 
above  the  three  strategies  lead  to  the  following 
results.  The  first  adaptive  approach  needed  24 
samples  and  achieved  an  accuracy  of  0.01 
while  using  four  overlapping  windows  (win¬ 
dow  orders:  Nf,o  =  4; =  3  /  N^,  =  5;Nc,  =  5  / 
Nf,2  =  =  8  and  Nfij  =  =6  arranged  in 

such  a  way,  that  always  two  windows  are  over¬ 
lapping).  With  the  second  approach  at  first  33 
samples  were  computed  according  to  Rom¬ 
berg’s  method  and  then  windows  with  13  sam¬ 
ples  per  window  (N^=6;Np  =  6)  always  over¬ 
lapping  from  sample  10  to  13  are  used  to  get 
the  overall  function.  Finally  an  accuracy  of 
6.5  10’  is  achieved.  With  using  the  Genetic 
Algorithm  30  samples  were  computed  and  the 
achieved  accuracy  is  5  10  ’ .  The  resulting  three 
best  approximation  functions  have  the  orders 
=  =  =  15,W„,  =  14  and 

Af^j  =  13,W^j  =  16. 

In  the  next  example  [7]  there  is  a  small  wire 
loop  in  a  cube  shaped  metallic  housing  radiat¬ 
ing  an  electromagnetic  field  (see  figure  3).  The 
shielding  effectiveness  (the  electrical  field  ra¬ 
diated  through  the  housing’s  fi'ont  plate  with 
four  small  slots  in  it  compared  to  the  electrical 
field  without  front  plate)  is  shown  in  figure  4. 
Here  only  84  samples  were  used  and  interpo¬ 
lated  with  MBPE  (GA  approach)  compared  to 
901  for  a  closely  and  uniformly  sampled  func¬ 
tion.  Therefor  five  windows  were  computed 
(100  MHz  -  309  MHz:  =  6 , 

309  MHz  -  418  MHz:  Af^,  =  3,Wo,  =  3 , 

418  MHz  -  550  MHz:  A^^,  =  10,Woj  =  13, 


550  MHz  -  760  MHz:  N^j  =  l2;Niyj=9 

760MHz  -1000 MHz,  =  \2;No,  =  S)  to 
achieve  the  overall  solution  function.  There  is 
almost  no  difference  between  the  two  function'! 
(the  maximum  error  over  the  whole  frequency 
range  in  this  case  is  10'^)  and  the  factor  in  time 
saving  (acceleration  factor)  is  901/84  =  10.7. 


Fig.  3.  Housing  (without  (left)  and  with  (right) 
front  plate)  containing  a  small  wire 
loop  radiating  an  electromagnetic  field. 


Shielding  effectiveness  in  dB 
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Fig.  4  Shielding  effectiveness  of  the  housing  in 
dB. 

One  other  examples  containing  also  very  sharp 
peaks  is  a  simple  dipole  forked  at  both  ends 
excited  by  an  incoming  plane  wave.  The  result¬ 
ing  magnitude  of  the  input  impedance  at  its 
ports  is  depicted  over  frequency  in  figure  5. 
Here  only  1 1  sample  points  (also  shown  in  the 
figure)  were  necessary  to  interpolate  this  curve 
with  an  accuracy  of  5.5  10’  compared  to  1500 
uniformly  and  closely  sampled  points.  This 
means  an  acceleration  factor  of  1000/11=90.9. 
One  window  is  used  to  approximate  the  curve 
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with  the  orders  Nc  =  6  and  =  4 .  With  a  little 
lower  accuracy  this  curve  could  be  interpolated 
with  only  10  points,  which  results  in  an  accel¬ 
eration  factor  of  100. 


.  IMPEDAWCC  (tXM) 


FREOUENCYIKfl  X 10* 


Fig.  5  Input  impedance  of  a  forked  dipole  an¬ 
tenna  excited  by  an  incoming  plane 
wave. 

Finally  it  can  be  said,  that  with  this  new  ap¬ 
proach  the  computation  time  has  been  much 
reduced  and  therewith  parameter  studies  and 
optimizations  for  larger  problems  needing  a  lot 
of  single  computations  can  be  done  now  with 
the  help  of  this  method  in  an  acceptable  time. 

Conclusions 

The  successful  implementation  of  Model  Based 
Parameter  Estimation  and  coupling  with  the 
field  computation  program  FEKO  is  presented 
here.  Adaptive  sampling  for  speeding  up 
computations  is  also  described.  The 
implemented  algorithm  converges  and  shows 
good  accuracy  in  first  examples,  which 
underline  the  advantages  of  the  algorithm. 
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Abstract 

In  solving  systems  of  linear  equations  arising 
from  practical  engineering  models  such  as  the 
electromagnetic  wave  scattering  problems,  it  is 
critical  to  choose  a  fast  and  robust  solver.  Due 
to  the  large  scale  of  those  problems,  precondi¬ 
tioned  Krylov  iterative  methods  are  most  suit¬ 
able.  The  Krylov  iterative  methods  require  the 
computation  of  matrix-vector  product  opera¬ 
tions  at  each  iteration,  which  account  for  the 
major  computational  cost  of  this  class  of  meth¬ 
ods.  We  use  the  multilevel  fast  multipole  algo¬ 
rithm  (MLFMA)  to  reduce  the  computational 
complexity  of  the  matrix-vector  product  oper¬ 
ations.  We  conduct  an  experimental  study  on 
the  behavior  of  three  Krylov  iterative  methods, 
BiCG,  BiCGSTAB,  and  TFQMR,  and  of  two 
preconditioners,  the  ILUT  preconditioner,  and 
the  sparse  approximate  inverse  (SAI)  precon¬ 
ditioner.  The  preconditioners  are  constructed 
by  using  the  near  part  matrix  numerically  gen¬ 
erated  in  the  MLFMA.  Our  experimental  re¬ 
sults  indicate  that  a  well  chosen  preconditioned 
Krylov  iterative  method  maintains  the  computa¬ 
tional  complexity  of  the  MLFMA  and  effectively 
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reduces  the  overall  simulation  time. 

1  Introduction 

The  hybrid  integral  equation  approach  combines 
the  volume  integral  equation  and  the  surface  in¬ 
tegral  equation  to  model  the  scattering  and  radi¬ 
ation  by  mixed  dielectric  and  conducting  struc¬ 
tures  [12].  For  example,  when  a  radome  is  ap¬ 
plied  to  an  antenna,  the  combined  system  con¬ 
sists  of  both  dielectrics  and  conductors.  Hence, 
the  hybrid  surface-volume  integral  equation  is 
ideal  for  this  problem  [3].  The  volume  integral 
equation  is  applied  to  the  material  region  (V^) 
and  the  surface  integral  equation  is  enforced  over 
the  conducting  surface  (5).  The  hybrid  surface- 
volume  integral  equations  for  electromagnetic 
scattering  problems  can  be  formally  written  as 
follows, 

{Ls(r,r')  •  +  Ly(r,r')  •  Jv(7’')}tan 

= -Elixir),  res, 

~E  -f  Ls{r,r')  ♦  Js(r')  -h  Lv(r,r')  •  Jv{r^) 

=  r  e  y, 

where  E^^^  stands  for  the  excitation  field  pro¬ 
duced  by  an  instant  radar,  the  subscript  “tan” 
stands  for  taking  the  tangent  component  from 
the  vector  it  applies  to,  and  LQ,(n  =  5,  Y),  is 
an  integral  operator  that  maps  the  source  Jq  to 
electric  field  E{r)  and  it  is  defined  as 

Ln(r,r')  -  Jnir')  = 
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ium  f  {I  +  G{r,  r')  ■  Mr')  dO'. 

Jq> 

Here  G(r,r')  =  -  r'|)  is  the  3D 

scalar  Green’s  function  for  the  background  me¬ 
dia,  and  i  =  It  should  be  pointed  out  that 
E  is  related  to  Jy  in  the  above  integral  equations 
by  Jy  =  icj{eb  —  e)E. 

We  follow  the  general  steps  of  the  method 
of  moments  (MoM)  [17]  to  discretize  the  hybrid 
surface- volume  integral  equations,  and  solve 
the  resultant  matrix  equation  by  a  multilevel 
fast  multipole  algorithm  (MLFMA),  which  is  a 
multilevel  implementation  of  the  fast  multipole 
method  (FMM).  A  matrix  equation  is  given  as 
the  form 


where  and  aX  stand  for  the  vectors  of  the 
expansion  coefficients  for  the  surface  current  and 
the  volume  function,  respectively  [3, 12],  and  the 
matrix  elements  can  be  generally  written  as 


•  /  dQ'{I  +  K^VV)G{r,  r')  ■  xir')fP'  ■ 

Jn' 

The  material  function  =  1  if  D'  is  a  surface 
patch,  and  x  —  (c/^6  “  1)  is  a  volume  cell. 
It  can  be  seen  that  the  coefficient  matrix  aris¬ 
ing  from  discretized  hybrid  integral  equations  is 
nonsymmetric.  Once  the  matrix  equation  (1) 
is  solved  by  numerical  matrix  equation  solvers, 
the  expansion  coefficients  and  can  be  used 
to  calculate  the  scattered  field  and  radar  cross 
section.  In  antenna  analysis  problems  the  co¬ 
efficients  can  be  used  to  retrieve  the  antenna’s 
input  impedance  and  calculate  the  antenna’s  ra¬ 
diation  pattern.  In  the  following,  we  use  A  to 
denote  the  coefficient  matrix  in  Equation  (1), 
X  =  and  b  —  [11^,11^]'^  for  simplicity. 

The  basic  idea  of  the  FMM  is  to  convert 
the  interaction  of  element-to-element  to  the  in¬ 
teraction  of  group-to-group.  Using  the  addition 
theorem  for  the  free-space  scalar  Green’s  func¬ 
tion,  the  matrix-vector  product  Ax  can  be  writ¬ 
ten  as 


Ax  =  (Ad  +  An)x  +  VfAVsX, 


where  V/,  A,  and  Vg  are  sparse  matrices.  We  de¬ 
note  Bn  =  (Ad^An)  for  simplicity.  The  FMM 


speeds  up  the  matrix-vector  product  operations 
and  reduces  the  computational  complexity  of  a 
matrix-vector  product  from  0{N^)  to 
where  N  is  the  order  of  the  matrix  [4].  The 
computational  complexity  is  further  reduced  to 
0{N  log  N)  with  the  multilevel  fast  multipole 
algorithm  (MLFMA)  [3].  As  the  level  of  the 
MLFMA  decreases,  we  find  that  the  number  of 
nonzeros  in  the  near  part  matrix  Bn  increases 
significantly.  The  accuracy  of  the  computed  so¬ 
lution  is  strongly  related  to  the  number  of  levels 
of  the  MLFMA  [11].  That  is,  as  the  number 
of  the  MLFMA  levels  decreases,  the  computed 
solution  is  close  to  the  exact  solution,  but  the 
near  part  matrix  Bn  becomes  denser.  It  is  well- 
known  that  the  MLFMA  is  an  approximation 
method. 

A  matrix  problem  involving  N  unknowns 
may  be  solved  in  floating  point  op¬ 

erations,  where  (7  is  a  constant  depending  on  the 
implementation  of  a  particular  iterative  method 
[2,  4,  5,  15],  and  is  the  floating  point  op¬ 
erations  needed  for  each  matrix- vector  multipli¬ 
cation.  For  many  realistic  problems,  de¬ 
pends  on  both  the  iterative  solver  and  the  tar¬ 
get  properties  (shape  and  material).  For  exam¬ 
ple,  a  problem  with  an  open-ended  cavity  needs 
much  more  iterations  than  that  with  a  solid  con¬ 
ducting  box  of  the  same  size.  Since  is  a 
proportional  factor  in  the  CPU  counter,  to  fur¬ 
ther  reduce  the  total  CPU  time,  it  is  necessary 
to  reduce  the  number  of  iterations  of  the  itera¬ 
tive  solvers.  Hence  preconditioning  techniques, 
which  may  speed  up  the  convergence  rate  of  the 
Krylov  iterative  methods,  are  needed  in  this  ap¬ 
plication. 

We  iteratively  solve  the  linear  system 
of  the  form  Ax  =  6,  where  the  coefficient 
matrix  A  is  a  large  scale,  dense,  and  com¬ 
plex  valued  matrix  for  electrically  large  targets. 
The  biconjugate  gradient  (BiCG)  method  [9], 
the  biconjugate  gradient  stabilized  method 
(BiCGSTAB)  [16],  and  the  transpose-free  vari¬ 
ant  of  the  quasi-minimum  residual  method 
(TFQMR)  [6]  are  some  of  the  well-known  Krylov 
iterative  methods  which  are  applicable  to  non- 
Hermitian  matrices.  In  our  experimental  tests, 
we  use  these  three  methods  as  the  iterative 
solver,  coupled  with  different  preconditioning 
strategies  to  solve  a  few  study  cases  of  repre¬ 
sentative  electromagnetic  scattering  problems. 
We  propose  to  use  an  incomplete  lower-upper 


ACES  JOURNAL,  VOL.  18,  NO.  4,  NOVEMBER  2003 


(ILU)  triangular  factorization  with  a  dual  drop¬ 
ping  strategy  [11,  14]  and  a  sparse  approximate 
inverse  (SAI)  technique  [8,  10,  19]  to  construct  a 
preconditioner  from  the  near  part  matrix  {Bn) 
in  the  MLFMA  implementation.  We  mainly 
compare  the  performance  difference  of  different 
Krylov  iterative  methods  combined  with  differ¬ 
ent  preconditioners. 

2  Preconditioned  Krylov  It¬ 
erative  Methods 

Krylov  iterative  methods  are  considered  to  be 
the  most  effective  iterative  solution  methods 
currently  available  [1,  7,  15].  The  complexity 
of  these  methods  is  on  the  order  of 
if  the  convergence  is  achieved  in  iterations. 
The  Krylov  iterative  methods  such  as  BiCG  re¬ 
quire  the  computation  of  some  matrix-vector 
product  operations  at  each  iteration,  which  ac¬ 
count  for  the  major  computational  cost  of  this 
class  of  methods. 

In  our  experiments,  we  observe  that  the 
convergence  behavior  of  BiCG  is  irregular.  Few 
theoretical  results  are  known  about  the  conver¬ 
gence  of  BiCG  [2].  BiCG  requires  two  matrix- 
vector  products  (one  with  A  and  one  with 
j  the  complex  conjugate  transpose  of  A) 
at  each  step  of  iteration.  BiCGSTAB  is  one 
of  variant  of  BiCG  to  avoid  the  irregular  con¬ 
vergence  patterns  of  BiCG.  A  residual  vector 
is  minimized  locally  and  it  has  substantially 
smoother  convergence  behavior.  Each  iteration 
step  of  BiCGSTAB  also  requires  two  matrix- 
vector  products  (both  with  A).  TFQMR  is  also 
chosen  to  get  a  smoother  convergence  behav¬ 
ior.  TFQMR  requires  two  matrix-vector  prod¬ 
ucts  (both  with  A)  at  each  iteration.  All  of  three 
solvers  designed  to  solve  non-Hermitian  linear 
systems.  Every  related  algorithm  which  is  im¬ 
plemented  in  our  program  originally  comes  from 
[2,  6,  15]. 

The  convergence  behavior  of  the  Krylov 
methods  depends  on  the  distribution  of  the 
eigenvalues  and  on  the  condition  number  of  the 
coefficient  matrix.  By  applying  a  good  precon¬ 
ditioner  we  may  achieve  better  spectrum  and 
smaller  condition  number  compared  to  those 
of  the  original  coefficient  matrix.  Therefore, 
the  convergence  behavior  of  the  Krylov  itera¬ 
tive  method  can  be  improved  by  a  good  pre¬ 


conditioner.  In  our  previous  papers  [10,  11] 
we  show  that  the  ILUT  and  the  SAI  precondi¬ 
tioned  BiCG  method  has  tight  spectrum  around 
1  and  small  condition  numbers  with  little  ex¬ 
tra  CPU  time  to  construct  the  precondition¬ 
ers  and  small  extra  memory  to  store  the  pre¬ 
conditioner  matrix.  For  some  problems,  iter¬ 
ative  methods  without  a  preconditioner  might 
not  converge.  Hence,  preconditioning  techniques 
should  be  used  in  practical  applications  with  the 
Krylov  iterative  method  to  reduce  the  number 
of  iterations.  Most  preconditioning  techniques, 
such  as  the  ILU(O),  rely  on  a  fixed  sparsity  pat¬ 
tern,  obtained  from  the  sparsified  coefficient  ma¬ 
trix  by  dropping  small  magnitude  entries.  Some 
SAI  techniques  need  access  to  the  full  coefficient 
matrix  (to  construct  a  sparsified  matrix),  which 
is  not  available  in  the  FMM. 

The  purpose  of  the  preconditioning  is  to 
make  the  preconditioned  matrix  MA  as  close  to 
the  identity  matrix  I  as  possible.  To  this  end,  we 
try  to  construct  a  matrix  M  that  approximates 
the  matrix  It  is  difficult  to  make  the  ma¬ 
trix  M  sparse,  since  in  most  cases  the  inverse 
of  a  matrix  A  is  dense  even  if  A  is  sparse.  We 
evaluate  three  different  Krylov  iterative  meth¬ 
ods  with  two  preconditioners,  the  ILU  precon¬ 
ditioner  with  a  dual  dropping  strategy  (ILUT) 
(with  a  fill-in  parameter  p  and  a  drop  tolerance 
r)  [11,  14]  and  the  SAI  preconditioner  [10,  19], 
using  the  non-preconditioning  case  as  compari¬ 
son.  In  the  MLFMA  implementation,  the  global 
matrix  A  is  not  numerically  available,  but  the 
near  part  matrix  Bn  is.  We  thus  construct  the 
ILUT  and  the  SAI  preconditioners  with  respect 
to  the  matrix  Bn^  The  total  storage  of  the  ILUT 
preconditioner  is  bounded  by  2pN.  Here  the  pa¬ 
rameter  r  controls  the  computational  cost,  and 
the  parameter  p  controls  the  memory  cost.  We 
use  a  static  sparsity  pattern  strategy  for  con¬ 
structing  the  SAI  preconditioner  M.  For  SAI, 
we  construct  a  sparsified  matrix  Bn  from  the 
near  part  matrix  Bn-  Here  the  matrix  Bn  is 
obtained  from  by  removing  some  small  mag¬ 
nitude  entries  of  Bn  with  respect  to  a  threshold 
parameter  ei.  The  computational  procedure  is 
given  in  [10],  in  which  ei,  62,  and  es  are  three 
user  provided  threshold  drop  tolerance  param¬ 
eters  chosen  by  a  heuristic  process.  By  judi¬ 
ciously  choosing  those  parameters,  we  are  able 
to  construct  both  preconditioners  that  are  effec¬ 
tive  and  do  not  use  much  memory  space.  We 
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use  the  notation  of  ILUT(r,  p)  and  SAI{ei,  e2, 
ea)  for  simplicity,  see  [10,  11]- 

3  Numerical  Experiments 

The  major  cost  of  the  preconditioned  itera¬ 
tive  solvers  is  the  matrix-vector  product  with 
both  the  coefficient  matrix  and  the  precondi¬ 
tioner  [18].  There  are  two  matrix-vector  prod¬ 
ucts  at  each  iteration  of  BiCG,  BiCGSTAB,  and 
TFQMR.  A  number  of  numerical  examples  are 
presented  to  demonstrate  the  performance  of  the 
preconditioned  Krylov  iterative  methods.^  Note 
that  the  examples  are  different  from  the  ones  in 
our  previous  two  papers  [10,  11]  with  different 
incident  angles. 

We  examine  the  convergence  behavior 
based  on  the  number  of  preconditioned  itera¬ 
tions.  We  compare  three  different  Krylov  itera¬ 
tive  methods  (BiCG,  BiCGSTAB,  and  TFQMR) 
with  the  ILUT  preconditioner,  the  SAI  precon¬ 
ditioner,  and  no  preconditioner  one  and  another. 
The  efficiency  of  the  ILU  preconditioner  with  a 
dual  dropping  strategy  (ILUT)  and  the  SAI  pre¬ 
conditioner  (with  BiCG)  is  reported  in  [10,  11]. 
Since  both  ILUT  and  SAI  preconditioners  have 
been  shown  to  be  efficient  for  solving  the  dense 
complex  linear  systems  from  electromagnetic 
wave  scattering  problems,  we  mainly  compare 
the  performance  difference  of  three  Krylov  itera¬ 
tive  methods,  BiCG,  BiCGSTAB,  and  TFQMR. 

We  calculate  the  radar  cross  section  (RCS) 
to  demonstrate  the  performance  of  our  precondi¬ 
tioned  Krylov  iterative  solvers  for  different  con¬ 
ducting  geometries  with  and  without  coating. 
The  geometries  considered  include  plates,  an¬ 
tenna  arrays,  and  cavities  (see  Table  1).  The 
mesh  sizes  for  all  the  test  structures  are  about 
one  tenth  of  a  wavelength. 

The  test  problems  are  described  in  Ta¬ 
ble  1  and  some  numerical  results  are  listed 
in  Table  2.  In  Tables  1  and  2,  “level” 
indicates  the  number  of  levels  in  the  mul¬ 
tilevel  fast  multipole  algorithm,  “setup”  the 
setup  CPU  time  in  seconds  for  constructing 

^  All  Ccises  are  tested  on  one  processor  of  an  HP  Super- 
dome  cluster  at  the  University  of  Kentucky.  The  proces¬ 
sor  has  2  GB  local  memoiy  and  runs  at  750  MHz.  The 
code  is  written  in  Fortran  77  and  is  run  in  single  pre¬ 
cision.  The  computation  terminates  when  the  2-norm 
residual  is  reduced  by  10“^,  or  the  number  of  iterations 
exceeds  2, 000. 


a  preconditioner,  “#it”  the  number  of  the 
(preconditioned)  Krylov  iterations,  and  “to¬ 
tal”  the  CPU  time  in  seconds  for  both  the 
setup  and  the  iteration  phase.  The  notations 
used  for  “case”  are  0=BiCG,  l=BiCGSTAB, 
2=TFQMR,  N=NONE,  I^ILUT,  and  S=SAI. 
Thus,  “P3C2S”  means  that  the  P3C  case  is 
solved  by  using  TFQMR  with  the  SAI  precon¬ 
ditioner. 

Due  to  space  limit,  we  report  one  set  of 
parameters  for  the  ILUT  and  the  SAI  precondi¬ 
tioners.  In  our  experiments,  we  use  ILUT(10“^, 
30)  for  the  PIC  and  P3C  cases,  ILUT(10"*^,  130) 
for  the  CIC  case,  and  SAI(0.03, 0.04, 0.05)  for  all 
test  cases. 

According  to  the  total  CPU  time  for  solv¬ 
ing  a  problem,  BiCGSTAB  with  the  SAI  and  the 
ILUT  preconditioners  seems  to  converge  very 
fast  for  most  cases  (see  Table  2).  Without  a 
preconditioner,  the  results  are  various  depend¬ 
ing  on  cases.  For  the  problem  which  has  a  small 
number  of  unknowns  (say,  less  than  1000),  the  it¬ 
erative  solver  with  the  SAI  preconditioner  takes 
more  time  than  with  the  ILUT  preconditioner, 
mainly  due  to  the  higher  cost  in  constructing  the 
SAI  preconditioner. 

Figures  1-3  show  the  number  of  itera¬ 
tions  of  (a)  BiCG  with  three  different  precon¬ 
ditioners,  (b)  BiCGSTAB  with  three  different 
preconditioners,  (c)  TFQMR  with  three  differ¬ 
ent  preconditioners,  (d)  NONE-preconditioned 
three  different  solvers,  (e)  ILUT  preconditioned 
three  different  solvers,  and  (f)  SAI  precondi¬ 
tioned  three  different  solvers. 

In  the  PIC  case,  we  observe  that  all  three 
iterative  solvers  with  the  SAI  preconditioner 
converge  faster  than  the  other  two,  and  with¬ 
out  a  preconditioner  are  the  slowest.  Without 
a  preconditioner,  BiCG  is  the  fastest  one  and 
TFQMR  is  the  slowest.  With  the  ILUT  pre¬ 
conditioner,  BiCGSTAB  is  the  fastest  one  and 
BiCG  is  the  slowest.  With  the  SAI  precondi¬ 
tioner,  BiCGSTAB  is  the  fastest  one  and  the 
BiCG  is  the  slowest. 

In  the  ClC  case,  we  see  that  BiCG  and 
BiCGSTAB  iterative  solvers  with  the  SAI  pre¬ 
conditioner  converge  faster  than  the  other  two, 
and  without  a  preconditioner  are  the  slowest. 
TFQMR  with  the  ILUT  preconditioner  is  the 
fastest  one  and  that  without  preconditioner  are 
the  slowest.  Without  a  preconditioner,  TFQMR 
is  the  fastest  one  and  BiCG  is  the  slowest.  With 


58 


ACES  JOURNAL,  VOL.  18,  NO.  4,  NOVEMBER  2003 


Table  1:  Information  about  the  matrices  used  in  the  experiments  (all  length  units  are  in  Ao,  the 
wavelength  in  free-space) . 


case 

level 

unknowns 

matrix 

nonzeros 

target  size  and  description 

PIC 

4 

1,416 

A 

2,005,056 

Dielectric  plate  over  conducting  plate 
2.98824  X  2  X  0.1 
frequency  =  200MHz 

66,384 

Bpj 

155,616 

CIC 

5 

20,176 

A 

407,070,976 

Cavity 

3  X  1  X  0.5 
frequency  =  iGHz 

1,565,032 

Bn 

3,728,842 

P3C 

7 

100,800 

A 

10,160,640,000 

Patch  array  (30  x  30) 

Ad 

3,571,808 

Array  size:  22.25  x  22.25 

Bn 

7,211,632 

frequency  =  300MHz 

Table  2:  Numerical  results  with  different  test  cases. 


case 

setup 

#it 

total 

case 

setup 

#it 

total 

case 

setup 

#it 

total 

PICON 

PlClN 

P1C2N 

PICOI 

PlClI 

P1C2I 

PICOS 

PICIS 

P1C2S 

- 

973 

1044 

2000 

112.3 

118.4 
225.9 

Cl  CON 
CICIN 
C1C2N 

- 

812 

751 

509 

1431.1 

1310.3 

893.2 

P3C0N 

P3C1N 

P3C2N 

- 

347 

201 

216 

2499.9 

1450.9 
1529.1 

0.3 

40 

24 

30 

5.4 

3.3 

4.3 

CICOI 

CICII 

C1C2I 

67.5 

367 

112 

179 

1028.4 

327.6 

577.5 

P3C0I 

P3C1I 

P3C2I 

P3C0S 

P3C1S 

P3C2S 

111.5 

37 

12 

20 

414.5 

205.7 

275.9 

8.0 

29 

15 

17 

11.5 

9.7 

10.1 

CICOS 

CICIS 

C1C2S 

110.9 

322 

120 

199 

714.9 

326.2 

476.4 

64.2 

41 

17 

26 

376.0 

189.6 

254.7 

the  ILUT  preconditioner,  BiCGSTAB  is  the 
fastest  one  and  BiCG  is  the  slowest.  With  the 
SAI  preconditioner,  BiCGSTAB  is  the  fastest 
one  and  the  BiCG  is  the  slowest. 

In  the  P3C  case,  we  find  that  all  three 
iterative  solvers  with  the  ILUT  preconditioner 
converge  faster  than  the  other  two,  and  with¬ 
out  a  preconditioner  are  the  slowest.  With  all 
three  different  preconditioners,  BiCGSTAB  is 
the  fastest  one  and  BiCG  is  the  slowest. 

Although  a  general  iterative  solver  for 
solving  some  categories  of  problems  efficiently 
might  not  exist  [13],  according  to  the  results 
from  our  numerical  experiments,  we  can  see  that 
BiCGSTAB  with  the  SAI  or  the  ILUT  precon¬ 
ditioners  is  robust  and  converges  very  fast  for 
solving  three  dimensional  model  cases  from  elec¬ 
tromagnetic  scattering  simulations.  In  all  cases, 
these  Krylov  iterative  methods  without  a  pre¬ 
conditioner  are  much  less  efficient. 

4  Conclusions 

We  conducted  a  few  numerical  tests  to  show  that 
the  Krylov  iterative  methods  coupled  with  the 


ILUT  and  the  SAI  preconditioners  are  efficient 
to  solve  the  problems  arising  from  electromag¬ 
netic  scattering. 

Our  numerical  results  indicate  that,  solv¬ 
ing  the  large  non-Hermitian  dense  linear  sys¬ 
tem  arising  from  the  electromagnetic  scatter¬ 
ing  by  using  the  BiCGSTAB  method  with  the 
ILUT  preconditioner  and  the  SAI  preconditioner 
achieves  faster  convergence  in  most  cases.  The 
ILUT  and  the  SAI  preconditioned  Krylov  itera¬ 
tive  solvers  (BiCG,  BiCGSTAB,  and  TFQMR) 
maintain  the  computational  complexity  of  the 
MLFMA,  and  consequently  reduces  the  total 
CPU  time.  Our  experimental  experience  may 
help  researchers  and  engineers  choose  suitable 
robust  solvers  in  practical  large  scale  electro¬ 
magnetic  simulations. 
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Abstract —  In  this  paper  a  method  for  the  solu¬ 
tion  of  scattering  problems  is  proposed.  In  partic¬ 
ular  the  EFIE  is  solved  by  a  collocation  point  pro¬ 
cedure,  where  the  unknown  current  density  is  ex¬ 
panded  by  Daubechies  wavelets  on  the  interval  and 
the  integration  is  performed  in  the  wavelet  domain 
by  the  integral  operator,  hence  without  the  use  of 
any  quadrature  formula.  Comparison  with  induced 
currents  calculated  by  a  standard  MoM  and  with 
fields  calculated  by  a  FEM  code  are  reported. 


I.  Introduction 

Wavelet  Expansion  (WE)  has  become  a  widely 
used  tool  in  electromagnetic  analysis.  The  main 
reasons  can  be  found  in  the  strong  interpolating 
properties  of  the  wavelet  functions  and  to  the  fact 
that  WE  and  reconstruction  can  be  performed  by 
the  use  of  fast  algorithms  (sec  for  example  [1]  -  [3]). 

The  analysis  of  scattering  problems  can  be  car¬ 
ried  out  expanding  the  unknown  functions  in  terms 
of  a  chosen  wavelet  basis  and  performing  a  Galcrkin 
procedure  using  the  same  wavelet  functions  as  test 
functions  (sec  [4]  -  [6]).  The  integration  is  then 
performed  by  the  use  of  quadrature  formulae. 

In  this  paper  the  EFIE  for  the  scattering  on  a 
conductive  body  is  solved  in  a  different  way.  By  uti¬ 
lizing  Daubechies  wavelets  on  the  interval  the  un¬ 
known  current  is  expanded,  then  a  collocation  point 
method  is  used  and  the  integration  is  performed 
by  the  use  of  the  integral  operator  for  Daubechies 
wavelets  on  the  interval  developed  by  the  authors 
(see  [8]).  In  this  way  the  need  of  quadrature  formu¬ 
lae  is  avoided;  furthermore  the  well  known  capabil¬ 
ity  of  the  wavelet  functions  of  representing  irregular 
signals  with  few  coefficients,  allows  us  to  use  bases 
of  low  dimension  (if  compared  with  the  number  of 
unknowns  of  a  standard  MoM);  for  these  reasons 
both  accuracy  and  CPU  time  saving  arc  achieved. 


II.  Wavelets  on  the  Interval  and 
Operators 

The  concepts  of  scaling  functions,  wavelets,  time- 
scale  analysis,  multiresolution  analysis  arc  here 
considered  known  [I];  there  are  many  wavelet 
bases  available  in  the  literature,  and  we  chose  the 
Daubechies  Wavelets  on  the  interval  [2]  for  their 
numerical  properties.  In  particular  the  choice  of 
wavelets  that  "survive”  only  on  intervals  is  adopted 
because  we  arc  interested  in  the  solution  of  a  bound¬ 
ary  value  problem. 

From  the  wavelet  theory  we  know  that  WE  must 
be  performed  starting  from  a  signal  known  at  a 
dyadic  number  of  samples.  This  number  is  equal  to 
the  dimension  of  the  basis  on  which  we  perform  the 
WE;  hence  a  signal  represented  by  n  =  2^  sam¬ 
ples  when  expanded  in  the  wavelet  domain  leads 
to  a  number  of  2^  coefficients.  For  the  com¬ 
pact  support  wavelet  there  is  an  important  rela¬ 
tion  that  allows  a  straight  computation  of  the  co¬ 
efficients  at  the  higher  resolution  of  a  generic  func¬ 
tion:  it  is  possible  to  obtain  them  from  the  sam¬ 
ples  of  the  functions  itself  according  to  the  relation 
{<t>J,k,f)  =  2'^/^/(2'^A:),  where  is  the  scaling 
function  of  order  J,  k  of  the  adopted  wavelet  basis. 
Then,  the  vector  of  wavelet  coefficients  g  represent¬ 
ing  the  wavelet  transform  of  a  function  g{x)  can  be 
obtained  by  multiplying  a  matrix  W  related  to  the 
adopted  wavelet  basis  and  the  samples  g{xj)  corre¬ 
sponding  to  2"^  equally  spaced  points  in  the  inter¬ 
val  [0,1].  Further  details  about  wavelet  numerical 
computation  can  be  found  in  [3]. 

When  performing  WE  of  a  function,  the  nota¬ 
tion  that  will  be  used  throughout  the  paper  is  the 
following: 

i 

where  b(a;)  =  [&i(a:),  •  •  •  &„(a;)]  is  the  wavelet  basis 
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and  f  =  [/i,  •  *  •  >  /n]^  is  the  vector  of  coefficients 
constituting  the  wavelet  expansion  of  the  signal. 

Also  operators  can  be  represented  in  the  wavelet 
domain,  as  described  in  [7].  The  authors  have  ob¬ 
tained  the  representation  of  the  integral  operator 
for  the  Daubechies  wavelets  on  the  interval  (see  for 
example  [8]).  The  convenience  in  using  operators  in 
the  wavelet  domain  is  that  the  integral  of  a  function 
f{x)  can  be  calculated  by  the  matrix- vector  prod¬ 
uct  Tf  where  T  is  the  constant  sparse  matrix  rep¬ 
resenting  the  operator  in  the  wavelet  domain  and 
f  is  the  wavelet  expansion  of  the  function  f{x).  In 
the  previous  operation  the  result  is  the  primitive 
function  of  /.  In  order  to  better  understand  the 
previous  statement  let  us  define  f{x)  =  we  can 
write  that 

f  f{x)dx  =  f  x^dx  =  ”X^  =  ^(x)  (2) 

[0.1]  [0,1] 

where  the  integral  limit  [0, 1]  indicates  that  we  are 
considering  the  interval  [0,1].  Expanding  (2)  ac¬ 
cording  to  (1)  in  the  wavelet  domain  we  can  write 
that 

f  b(x)fdx  =  b(x)g  (3) 

[0,1] 

where  f  and  g  are  the  wavelet  expansion  of  the  two 
functions.  Left  multiplying  by  b(x)^  and  taking 
into  account  the  definition  of  the  integral  operator 

(  T  =  (b(x),y' b(x)))  and  the  orthogonality  prop¬ 
erties  of  the  wavelet  basis  (Id  =  (b(x),  b(x)),  whith 
Id  being  the  identity  matrix)  we  can  write  that 

Tf  =  g  (4) 

Hence  as  clearly  stated  in  equation  (4)  the  cal¬ 
culation  of  function  ^(x)  can  be  performed  in  the 
wavelet  domain  by  multiplying  matrix  T  by  vector 
f,  the  wavelet  expansion  of  the  function  /(x);  then 
the  result,  vector  g,  is  inverse  transformed,  obtain¬ 
ing  the  function  g{x). 

In  order  to  compute  a  definite  integration  directly 
in  the  wavelet  domain,  two  new  (row)  vectors  must 
be  introduced:  vectors  bo  =  b(0)  and  bi  =:  b(l) 
that  are  the  values  of  the  basis  functions  on  the  left 
and  right  border,  respectively,  of  the  interval  [0,  Ij. 
The  meaning  of  these  two  vectors  is  the  following: 
given  a  function  /(x),  its  left  border  value  /(O)  can 
be  obtained  by  the  coefficients  of  its  WE  simply  by 
the  use  of  the  vector  bo  as  /(O)  =  bof. 


Suppose  that  we  want  to  compute 
1  1 

J  f{x)dx  =  j  x^dx  =  ^  (5) 

0  0 

Then  starting  from  equation  (4)  it  must  be  con¬ 
sidered  that  the  primitive  function  (g)  is  already 
calculated,  and  we  only  need  to  evaluate  it  on  the 
borders  of  the  interval.  Hence  it  yields 

1 

J  f{x)dx  =  big  -  bog  (6) 

0 

which  can  be  rewritten  as 
1 

J  f{x)dx  =  biTf  -  boTf  (7) 

0 

As  evidenced  in  equation  (7)  the  calculation  of 
the  definite  integral  of  a  function  /(x)  can  be  per¬ 
formed  knowing  its  WE  f ,  the  integral  operator  ma¬ 
trix  T  and  the  border  vectors.  The  quantities  T, 
bo  and  bi  are  known  once  a  wavelet  basis  has  been 
chosen,  so  they  need  to  be  computed  only  once,  and 
not  at  any  analysis. 


III.  Method  of  Moments  and  Wavelet 
Expansion 

A.  General  Considerations 

In  the  study  of  scattering  from  conducting  cylin¬ 
ders,  an  integral  equation  can  be  formulated,  which 
in  general  has  the  form  of 


A 


■J  J  Js(r')G(rs,r')ds'+ 


+v//  V'- Js(r')G(rs,r')ds' 


=  E‘(r  =  r,) 


(8) 

where  77  —  y/ and  Jg  is  the  cur¬ 

rent  density  induced  on  the  scatterer,  G  is  the  green 
function  for  the  three  dimensional  scatterer,  r'  and 
Fg  are  respectively  the  integration  variable  and  the 
observation  point,  both  on  the  surface  scatterer; 
and  E[  is  the  incident  field. 

In  a  simpler  way  and  in  one  dimension  equation 
(8)  can  be  in  general  rewritten  as 


/ 


g{x^)K{x,  x')dx'  -1-  c{x)g{x)  =  h{x)  (9) 


where  ^^(x)  is  our  unknown  function.  In  the  lit¬ 
erature  [5],  function  g  is  expanded  in  the  wavelet 
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domain,  and  the  expansion  is  substituted  in  (9).  In 
order  to  obtain  a  linear  system  for  the  unknown 
coefficients,  the  resultant  equation  is  tested  with 
the  same  expansion  functions  (Galerkin’s  method). 
Quadrature  formulae  available  in  the  literature  for 
wavelets  and  scaling  functions  are  used,  and  a 
square  system  is  obtained,  which  solved  gives  the 
unknown  coefficients. 

Due  to  the  high  interpolating  properties  of  the 
Daubechies  wavelets  on  the  interval,  we  have 
adopted  this  family  of  wavelets,  for  which  the  au¬ 
thors  have  developed  the  representation  of  the  inte¬ 
gral  operator.  Wavelets  on  the  interval  (and  so  op¬ 
erators  in  the  wavelet  domain)  are  defined  on  [0, 1], 
hence  the  contour  of  the  scattercr  must  be  mapped 
into  the  interval  [0, 1].  As  it  is  suggested  in  [6]  for 
an  arbitrary  contour  of  the  scattercr  two  steps  must 
be  performed: 

•  The  contour  of  the  scattercr  is  discretized  in 
boundary  elements  and  then  each  boundary  ele¬ 
ment  (simply  a  first  order  element)  is  mapped  into 
one  dimensional  standard  element  through  shape 
functions  or  interpolation  functions. 

•  The  standard  elements  are  mapped  into  corre¬ 
sponding  portions  of  the  interval  [0, 1]. 

In  this  way  the  basis  functions  are  defined  in  a 
standard  way  on  the  interval  [0, 1],  since  the  contour 
has  been  mapped  on  this  interval. 

B.  Scattering  from  a  Conductive  Body 

In  case  of  a  two  dimensional  problem  with  TM^ 
polarization  the  EFIE  equation  is  the  following: 

^  -  p'\)dc'  =  Eiipm)  (10) 

4  Jc 

where  pm.  is  any  observation  point  on  the  scattercr, 
p'  is  any  source  point  on  the  scattercr  and  C  is  the 
contour  of  the  scattercr. 

After  the  mapping  on  the  elemental  interval  [0, 1] 
(described  in  the  previous  section)  is  performed,  we 
perform  a  classical  collocation  point  procedure:  we 
evaluate  equation  (10)  at  a  particular  point  pm  of 
the  contour;  hence  equation  (10)  can  be  rewritten 
as 

1 

k  j  f{x)g{x)dx  “  h  (11) 

b 

where  k  =  f{x)  =  HQ^\p\pm  “  p’\)  g{x)  = 

Then  the  wavelet  expansion  in  the  space  domain 
is  performed.  It  has  to  be  noticed  that  in  equation 


(II)  there  is  the  product  between  two  functions,  in 
particular  one  of  them  is  known  {f{x))  while  the 
other  is  our  unknown.  As  explained  in  [8]  it  is 
possible  to  obtain  the  wavelet  expansion  of  a  the 
product  between  two  functions  y{x)  =  f[x)g[x)  as 
the  product  between  a  constant  diagonal  matrix  F 
and  a  vector  g,  where  g  is  the  wavelet  expansion  of 
the  function  g{x)  and  F  is  a  diagonal  matrix  whose 
entries  arc  the  samples  at  n  =  2^  equally  spaced 
points  of  the  values  of  f[x)  in  the  interval.  This 
approximation  is  as  much  accurate  as  the  number 
of  samples  is  high,  hence  as  the  resolution  of  the 
chosen  basis  increases.  In  case  the  two  functions 
are  know  the  above  described  procedure  is  useless, 
since  the  expansion  of  y[x)  can  be  performed  di¬ 
rectly.  But  in  the  cases  when  one  of  the  two  func¬ 
tions  is  unknown  then  the  procedure  is  fundamen¬ 
tal,  since  it  allows  to  keep  g  as  the  unknown  vector 
and  anyway  perform  the  wavelet  expansion.  Hence 
equation  (11)  can  be  expressed  in  the  wavelet  do¬ 
main  as 

1 

k  J  h{x)Fgdx  =  b(x)h  (12) 

0 

where  F  is  the  diagonal  matrix  with  the  samples 
of  the  Hankcl  function,  g  is  the  vector  of  unknown 
coefficients  and  h  is  the  expansion  of  constant  h 
on  the  interval  [0, 1]  i.e,  considered  as  a  constant 
function  on  the  whole  interval.  By  left  multiplying 
equation  (12)  by  h{x)'^  we  obtain 

1 

kh{x)'^  J  h{x)dxFg  =  b(a:)^b(a:)h  (13) 

0 

and  taking  into  account  the  definition  of  the  inte¬ 
gral  operator  and  the  orthogonality  properties  of 
the  chosen  basis  we  obtain 

/cTFg  -  h  (14) 

In  this  way  equation  (14)  establishes  a  relation 
between  the  primitive  of  the  product  fix)g{x)  and 
the  constant  function  h{x)  =  constant.  This  is 
actually  something  different  from  what  we  want, 
hence  the  two  vectors  bo  and  bi  (introduced  in  sec¬ 
tion  II)  must  be  employed.  Hence  we  can  write  that 

biibTFg  -  bo/cTFg  =  h  (15) 

Equation  (15)  is  characterized  by  known  matri¬ 
ces  T  and  F  and  known  vectors  boand  bjand  by 
the  unknown  vector  of  coefficients  g.  Once  the  res¬ 
olution  of  the  wavelet  basis  is  chosen  the  number  of 
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basis  functions  is  consequently  fixed,  hence  equa¬ 
tion  (15)  must  be  written  for  a  number  of  n  points 
on  the  interval  itself.  This  leads  to  a  sparse  linear 
square  system  whose  unknowns  are  the  coefficients 
g  and  which  can  be  solved  in  low  CPU  time. 

IV.  Numerical  Results 

The  numerical  results  presented  here  are  relative 
to  the  scattering  of  a  square  conductive  object  illu¬ 
minated  by  a  polarized  TMz  field.  The  geometry 
of  the  system  and  the  input  signal  are  reported  in 
figures  1  and  2. 

A  y 


Fig.  1.  Geometry  of  the  system 


calculated  current  density  on  the  surface  scatterer 
for  a  number  of  64,  128  and  256  wavelet  functions 
for  a  frequency  of  1  GHz. 


Fig.  3.  Current  density  on  the  scatterer  calculated  by  the 
use  of  64  wavelets 

It  is  evident  that  together  with  the  increase  of  the 
resolution  from  64  to  256,  the  accuracy  of  the  re¬ 
sults  becomingh  higher.  Nevertheless  even  at  lower 
resolutions  the  obtained  current  (at  a  very  lo  CPU 
time  cost)  is  consistent  witht  the  problem. 


Fig.  4.  Current  density  on  the  scatterer  calculated  by  the 
use  of  128  wavelets 


Fig.  2.  Input  signal 

The  diagonal  of  the  scatterer  is  of  0.2m  and  the 
frequency  content  of  the  input  signal  is  of  the  order 
of  GHz.  FFT  has  been  used  in  order  to  obtain  the 
behavior  in  the  time  domain.  Figures  3-5  show  the 


Figure  6  shows  the  comparison  between  the  cal¬ 
culations  performed  by  the  proposed  method  with 
a  resolution  of  256  wavelets  and  a  standard  MoM 
(with  collocation  point)  technique  with  500  points 
on  the  whole  perimeter.  It  can  be  seen  the  very 
good  agreement  between  the  two  different  methods. 


66 


ACES  JOURNAL,  VOL.  18,  NO.  4,  NOVEMBER  2003 


Fig.  5.  Current  density  on  the  scatterer  calculated  by  the 
use  of  256 


Fig.  6.  Current  density  on  the  scatterer  calculated  by  the 
use  wavelets  and  MoM 

As  for  the  CPU  time  the  method  proposed  is 
of  the  order  of  2-4  times  faster  than  the  standard 
MoM,  due  to  the  high  numerical  efficiency  of  the 
wavelet  expansion  and  integration,  as  described  in 
the  previous  sections:  the  construction  of  the  inte¬ 
gral  operator  matrix  and  the  border  vectors  is  done 
only  once  and  can  be  seen  as  a  pre  processing  activ¬ 
ity,  while  the  integration  is  performed  by  a  simple 
matrix  -  vector  product,  without  the  need  of  any 
quadrature  formula.  Furthermore  the  well  known 
numerical  properties  of  the  wavelet  functions  (well 
addressed  in  the  literature)  allow  the  choice  of  bases 
of  small  dimensions  in  order  to  obtain  accurate  re¬ 
sults. 

Since  the  calculation  of  the  current  density  is  not 


the  only  important  result  from  an  engineering  poin 
of  view,  a  comparison  with  results  obtained  by  the 
use  of  a  FEM  code  of  the  total  field  are  also  re¬ 
ported.  In  figures  7  and  8  the  electric  field  in  point 
P  =  (-0.2,0)  and  P  =  (0, -0.2)  are  reported, 
evaluated  by  the  FEM  code  and  by  the  proposed 
method,  by  using  a  wavelet  basis  of  256  functions. 


Fig.  7.  Electric  field  evaluated  at  point  P  =  (—0.2,0) 


Fig.  8.  Electric  field  evaluated  at  point  P  =  (0,  —0.2) 


Figure  9  reports  the  comparison  between  two  so¬ 
lutions  obtained  by  the  proposed  method  at  dif¬ 
ferent  resolutions,  in  particular  with  64  and  256 
wavelets,  and  shows  the  robustness  of  the  method 
in  terms  of  calculated  fields 
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Fig.  9.  Comparison  at  different  resolutions  of  Electric  field 
evaluated  at  point  P  =  (0,  —0.2) 


V.  Conclusion 

The  method  proposed  here  performs  the  analysis 
of  scattering  problems  by  the  use  of  a  MoM  tech¬ 
nique  numerically  implemented  by  wavelet  expan¬ 
sion.  In  particular  the  use  of  the  integral  operator 
in  the  wavelet  domain  makes  it  possible  to  obtain 
the  unknown  current  without  the  use  of  quadra¬ 
ture  formulae,  and  the  high  interpolating  proper¬ 
ties  of  the  chosen  wavelet  basis  gives  results  with 
low  CPU  time.  The  good  quality  of  the  results  is 
demost rated  by  comparisons  with  standard  MoM 
and  FEM  computations. 
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Abstract:  An  alternative  method  for  generating 
higher  dimensional  wavelet-like  basis  functions 
is  proposed  in  this  paper.  One  method  that  has 
been  used  was  to  derive  the  two-dimensional 
wavelet-like  basis  from  the  two-dimensional 
traditional  finite  element  basis.  However,  in  this 
paper,  products  of  one-dimensional  wavelet-like 
functions  are  used  as  two-dimensional  wavelet¬ 
like  basis  functions.  The  generation  of  linear 
wavelet-like  funetions  is  discussed  in  detail  and 
the  use  of  linear  and  higher  order  wavelet-like 
functions  is  also  investigated.  The  advantages 
and  disadvantages  of  this  teehnique  for  deriving 
wavelet-like  basis  functions  will  be  discussed. 

Keywords:  Wavelets,  Iterative  Techniques, 

Finite  Element  Methods 

I.  INTRODUCTION 

Wavelets  and  wavelet  analysis  have  recently 
become  increasingly  important  in  the 
computational  sciences.  Wavelets  have  many 
applications  in  areas  such  as  signal  analysis, 
image  compression,  and  the  numerical  solution 
of  partial  differential  equations  and  integral 
equations.  Only  rather  recently,  however,  have 
wavelets  begun  being  used  in  computational 
electromagnetics.  The  multiresolution  time 
domain  technique  (MRTD),  developed  by  Katehi 
et.  al,  has  attracted  abundant  interest  in  the  use  of 
wavelets  as  basis  funetions  [1].  Gordon  has  used 
wavelet-like  basis  functions  in  the  numerieal 
solution  of  elliptic  partial  differential  equations 


[2].  The  wavelet-like  functions  have  also  been 
used  as  the  basis  for  a  finite  element  time- 
domain  algorithm  [3].  Although  the  wavelet-like 
functions  are  not  true  wavelets,  they  do  exhibit 
some  of  the  benefits  that  have  caused  wavelets  to 
receive  attention.  One  advantage  that  will  be 
discussed  in  detail  in  this  paper  is  that  wavelet¬ 
like  basis  functions  have  good  stability  and 
convergence  properties. 

II.  GENERATION  OF  BASIS  FUNCTIONS 

The  method  for  generating  the  wavelet-like 
basis  was  first  discussed  by  Jaffard  in  [4]. 
Consider  the  generation  of  linear  wavelet-like 
functions  for  which  the  domain,  Q,  is  the  line 
segment  0<x<l.  Assume  that  the  problem 
under  eonsideration  has  Dirichlet  boundary 
conditions  so  that  the  value  of  the  solution  is 
specified  at  both  endpoints  of  the  problem 
domain.  This  eliminates  the  necessity  of  nodes 
at  the  endpoints  of  the  domain.  To  begin  the 
multiresolution  analysis  (MRA)  for  the 
generation  of  the  linear  wavelet-like  basis 
functions,  an  initial  discretization  is  chosen  such 
that  there  is  a  single  node  placed  at  the  midpoint 
of  the  domain  (Fig.  1).  This  corresponds  to 
beginning  with  two  segments  in  the  initial 
discretization;  one  segment  fromO  <  x  <  0.5 ,  and 
another  segment  from  0.5  <  x  <  1 .  This  particular 
discretization  is  not  a  requirement;  a  very  simple 
or  extremely  complex  segmentation  may  be  used 
as  the  initial  discretization  of  the  problem 
domain. 
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I  ■  Beginning  mesh  with  a  single  node  | 

- 1 — 1 - 1 - 1 - •- 1 - 1 - 1 — ^ - 

0.000  0.125  0.250  0.375  0.500  0.625  0.750  0.875  1.000 

Fig.  1 .  Initial  discretization  with  a  single  node  in  the  mesh. 

This  beginning  discretization  is  chosen  for  ease 
of  illustration  of  the  MRA.  The  traditional  linear 
basis  function  associated  with  the  node 
atx  =  0.5  is  normalized  (Fig.  2). 


Fig.  2.  Wavelet-like  function  from  the  first  level  of  the 
MRA. 

The  first  level  of  the  MRA  has  now  been 
completed  and  this  function  is  considered  to  be 
the  first  wavelet-like  basis  function.  To  begin 
the  second  level  of  the  MRA,  each  of  the  two 
segments  from  the  first  level  is  divided  into  two 
equal  segments.  After  doing  this,  there  are  now 
four  segments  in  the  domain;  one  segment  from 
0<a:<0.25,  another  segment  from 
0.25  <  jc  <  0.5 ,  another  from  0.5  <  ;c  <  0.75 ,  and 
another  from  0.75<a:<1.0.  The  node  at 
jic  =  0.5  is  not  a  new  node,  and  the  function 
associated  with  it  is  discarded.  The  nodes 
located  at.x  =  0.25  and  atx  =  0.75  are  new  nodes 
(Fig.  3).  Therefore,  these  two  nodes  need  to  be 
considered  in  the  analysis.  Next,  the  traditional 
basis  functions  that  are  associated  with  the  two 
nodes  are  orthogonalized  against  the  wavelet-like 
function  from  the  first  level;  then,  the  resulting 


functions  will  be  orthonormalized  against  each 
other. 


■  Single  Node  at  the  Beginning 
▲  Nodes  Added  at  the  Second  Level 


- ^ - 1 - ^ - 1 - IP - , - ^ - 1 - , - - 

0.000  0.125  0.250  0.375  0.500  0.625  0.750  0.875  1.000 

Fig.  3.  Initial  discretization  with  added  nodes  from  second 
level  of  the  MRA. 

After  the  orthonormalization,  the  two  functions 
can  be  added  to  the  wavelet-like  basis.  Now  the 
second  level  of  the  MRA  is  complete  and  there 
are  three  wavelet-like  functions  in  the  basis.  The 
wavelet-like  function  associated  with  the  node  at 
X  =  0.25  is  shown  in  Fig.  4. 


Fig.  4.  Wavelet-like  function  added  at  the  second  level  of 
the  MRA. 

The  process  of  subdividing  the  segments  and 
orthogonalizing  traditional  basis  functions 
against  previous  wavelet-like  basis  functions  and 
then  orthonormalizing  the  resulting  functions  can 
be  continued  until  the  desired  level  of 
discretization  is  reached.  Figures  5,  6,  and  7 
show  the  progression  of  the  subdivision  of  the 
line  segment  from  the  third  level  to  the  fifth  level 
of  the  analysis.  Also,  the  linear  wavelet-like 
basis  function  associated  with  the  node  at 
x  =  0.375,  which  was  added  during  the  third 
level  of  the  MRA,  is  shown  in  Fig.  8. 
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0.000  0.125  0.250  0.375  0.500  0.625  0.750  0.875  1.000 


Fig.  5.  Second  Level  discretization  with  added  nodes  from 
the  third  level. 


0.000  0.125  0.250  0.375  0.500  0.625  0.750  0.875  1.000 


Fig,  6.  Third  level  discretization  with  added  nodes  from 
the  fourth  level. 


0.000  0.125  0.250  0.375  0.500  0.625  0.750  0.875  1.000 


Fig.  7.  Fourth  level  discretization  with  added  nodes  from 
the  fifth  level. 


0.0  0.2  0.4  0.6  0.8  1.0 


Fig.  8.  Wavelet-like  function  added  at  the  third  level  of 
the  MRA. 

This  concludes  the  discussion  of  the  generation 
of  one-dimensional  linear  wavelet-like  functions. 
Now  there  will  be  a  brief  diseussion  of  the 
generation  of  higher  dimensional  and  higher 
order  wavelet-like  functions. 


There  are  two  methods  that  have  been  used  to 
generate  higher  dimensional  wavelet-like 
functions.  One  possibility  is  to  generate  them 
from  their  higher  dimensional  traditional  finite 
element  counterparts.  For  example,  a  piecewise 
linear  two-dimensional  wavelet-like  basis  can  be 
generated  from  the  traditional  two-dimensional 
tetrahedral  basis.  However,  this  is  not  how 
higher  dimensional  wavelets  are  typieally 
ereated.  Instead,  they  are  generally  formed  from 
products  of  one-dimensional  wavelets  [5].  In 
two  dimensions,  this  yields 

Hutchcraft  and  Gordon  have  shown  that  this 
technique  can  also  be  employed  using  products 
of  wavelet-like  functions  [6]. 

Just  as  higher  dimensional  wavelet-like 
functions  can  be  generated  using  their  traditional 
counterparts,  so  can  higher  order  wavelet-like 
funetions.  These  funetions  have  been  used  by 
Hutchcraft  and  Gordon  in  the  numerical  solution 
of  a  one-dimensional  problem  in  [7]  in  which  the 
traditional  piecewise  cubic  basis  functions  are 
used  to  generate  piecewise  cubic  wavelet-like 
basis  funetions.  Implementing  both  of  these 
concepts,  higher  order,  higher-dimensional 
wavelet-like  functions  can  be  generated  by 
forming  products  of  one-dimensional  higher 
order  wavelet-like  functions. 

III.  EXAMPLES  OF  ONE  AND  TWO- 

DIMENSIONAL  BASIS  FUNCTIONS 

Consider  a  rectangular  region  as  the  domain 
for  a  two-dimensional  problem.  To  obtain  a  two- 
dimensional  wavelet-like  basis,  one-dimensional 
wavelet-like  functions  need  to  be  generated  in 
both  the  X-  and  y-directions  by  the  method 
outlined  previously.  For  the  two-dimensional 
wavelet-like  basis,  all  produets  of  a  wavelet-like 
function  in  the  x-direction  with  a  wavelet-like 
function  in  the  y-direction  will  be  considered  a 
two-dimensional  wavelet-like  basis  function; 
thus,  the  total  number  of  wavelet-like  functions 
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generated  by  this  procedure  will  be  the  total 
number  of  wavelet-like  functions  in  the  x- 
direction  multiplied  by  the  total  number  of 
wavelet-like  functions  in  the  y-direction. 

To  aid  in  the  visualization  of  these  functions, 
Figs.  9-16  show  several  one-  and  two- 
dimensional  linear  and  cubic  wavelet-like 
functions.  First,  Figs.  9  and  10  illustrate  one¬ 
dimensional  cubic  wavelet-like  functions.  In 
Fig.  10,  the  more  slowly  varying  of  the  two 
functions  is  from  the  first  level  of  the  MRA.  It 
has  a  single  piecewise  cubic  representation  over 
the  entire  domain.  The  other  function  in  Fig.  10 
is  from  the  second  level  in  the  MRA.  It  is  also 
piecewise  cubic,  but  it  has  two  different 
representations;  one  representation  for  the 
segment  0  <  x  <  0.5  and  another  representation  for 
the  segment  fromO.5  <  x  <  1 .0 . 


Fig.  9.  Third  order  wavelet-like  basis  functions  from  the 
and  2"“*  levels. 

As  discussed  previously,  two-dimensional 
wavelet-like  functions  are  obtained  by  forming 
products  of  one-dimensional  wavelet-like 
functions.  Figure  11  shows  a  two-dimensional 
linear  wavelet-like  function.  The  linear  wavelet¬ 
like  function  B6(x,y),  which  could  also  be  written 
as  B2(x)B3(y)  to  denote  that  it  is  derived  from  the 
product  of  the  2"'*  basis  function  in  the  x- 
direction  and  the  basis  function  in  the  y- 
direction,  is  formed  from  a  function  from  the 
second  level  of  the  MRA  in  the  x-direction  and  a 


function  from  the  second  level  of  the  MRA  in  the 
y-direction. 


Fig.  10.  Third  order  wavelet-like  basis  functions  from  the 
2"'*  and  3'“'  levels. 

B2(x)B3(y) 


Fig.  1 1 .  Linear  wavelet-like  basis  function  obtained  from  a 
2nd  level  x  and  2nd  level  y  function. 

Plots  of  several  two-dimensional  cubic  wavelet¬ 
like  basis  functions  are  shown  in  Figs.  12,  13,  14, 
and  15.  Bi(x,y)  is  a  cubic  wavelet-like  basis 
function  that  is  generated  from  the  first  level  in 
both  the  X-  and  y-directions  (Fig.  12).  As  can  be 
seen  from  the  figure,  this  function  is  nonzero 
over  most  of  the  domain.  It  is  also  a  piecewise 
cubic  polynomial  in  the  x-direction  and  a 
piecewise  cubic  polynomial  in  the  y-direction. 
B5(x,y)  and  Bio(x,y)  are  both  generated  from  the 
first  level  of  the  MRA  in  the  y-direction  and  the 
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second  level  of  the  MRA  in  the  x-direction  (Figs. 
13  and  14).  In  the  x-direction,  each  of  these  two 
functions  has  two  different  piecewise  cubic 
representations;  on  the  other  hand,  both  of  these 
functions  have  a  single  representation  in  the  y- 
direction.  Specifically,  in  the  x-direction,  there 
is  one  piecewise  cubic  representation  for  the 
segment  0  ^  x  ^  2.0 ,  and  another  piecewise 
cubic  representation  for  the  segment 
2.0  <  X  <  4.0  .  Bi5(x,y)  is  a  basis  function  that  is 
obtained  from  a  second  level  x-directed  function 
and  a  second  level  y-directed  function  (Fig.  15). 
This  function  has  two  different  piecewise  cubic 
representations  in  both  the  x-  and  y-directions. 

Bi(x.y) 


Fig.  12.  Cubic  wavelet-like  basis  function  obtained  from  a 
level  X  and  level  y  function. 

B5{x.y) 


Fig,  13.  Cubic  wavelet-like  basis  function  obtained  from  a 
2""*  level  X  and  level  y  function. 


Bio(x.y) 


Fig.  14.  Cubic  wavelet-like  basis  function  obtained  from  a 
2"^^  level  X  and  level  y  function. 


Bi5{x,y) 


Fig.  15.  Cubic  wavelet-like  basis  function  obtained  from  a 
2"‘*  level  X  and  2"*'  level  y  function. 

With  wavelet  analysis,  as  levels  in  the  MRA 
are  added,  the  wavelets  become  more  localized. 
As  can  be  seen  from  these  figures,  the  wavelet¬ 
like  basis  functions  also  possess  this  property; 
they  have  a  large  magnitude  in  a  smaller  portion 
of  the  domain  as  the  level  in  the  MRA  for  either 
(or  both)  the  x-  or  y-directions  increases.  Bi(x,y) 
has  a  rather  large  magnitude  over  the  entire 
domain.  Again,  B5(x,y)  and  Bio(x,y)  are  from 
the  second  level  in  the  x-direction  and  the  first 
level  in  the  y-direction;  notice  that  these  two 
functions  have  a  large  value  only  in  half  of  the 
region.  Bi5(x,y)  is  a  function  from  the  second 
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level  in  both  the  x-  and  y-directions  and  its  value 
is  large  only  in  one-quarter  of  the  domain. 

IV.  EXAMPLE  PROBLEM 

As  an  example  of  the  use  of  the  wavelet-like 
basis  functions,  consider  the  following 
differential  equation 

-V-{a{x,y)Vu{x,y))+b{x,y)u[x,y)  =  g{x,y)  (2) 

in  which  the  domain  is  the  rectangular  region 
from  x  =  0.0to  x  =  4.0  and  from  y  =  0.0  to 
y  =  3.0 .  Laplace’s  equation  can  be  obtained  by 
choosing  the  following:  a(x,y)  =  -\.0 , 

b{x,y)  =  0.0,  and  g(x,y)  =  0.0.  An  illustration 
of  the  problem  domain  along  with  the  boundary 
conditions  is  shown  in  Fig.  16. 

u(x,3.0)  =  sin(jtx/4) 

u(0.0,y)=0.0  u(4.0,y)=0.0 

_ u(x,0.0)=0.0 _ 

Fig.  16.  Problem  domain. 

Solutions  were  obtained  using  the  traditional 
two-dimensional  basis  functions,  two- 
dimensional  basis  functions  that  were  products  of 
linear  wavelet-like  basis  functions,  and  two- 
dimensional  basis  functions  that  were  products  of 
cubic  wavelet-like  basis  functions.  A 
comparison  of  the  analytic  solution  and  the 
numerical  solution,  which  was  found  using  961 
linear  wavelet-like  basis  functions,  is  made  along 
the  line  x  =  1.5  (Fig.  17).  The  numerical 
solution  in  this  case  corresponds  to  31  wavelet¬ 
like  functions  in  each  direction  (31*31=961  total 
basis  functions).  To  illustrate  the  accuracy  when 
the  linear  wavelet-like  basis  is  used,  the  curves 
for  the  numerical  solution  and  the  analytic 
solution  lie  on  top  of  each  other.  To  illustrate 
that  an  accurate  solution  is  also  obtained  when 
the  cubic  wavelet-like  basis  functions  is  used,  the 
analytic  solution  and  the  numerical  solution. 


which  was  obtained  with  55  cubic  wavelet-like 
basis  functions,  are  compared  along  the  line 
y=1.5  (Fig.  18).  Again,  these  two  curves  are 
indistinguishable  on  the  graph.  The  numerical 
solution  obtained  with  only  25  cubic  wavelet-like 
basis  functions  is  plotted  in  figure  19.  For  this 
graph,  five  cubic  wavelet-like  functions  in  each 
direction  were  used  as  the  two-dimensional 
basis.  As  expected,  very  few  cubic  basis 
functions  are  necessary  to  obtain  an  accurate 
solution.  From  these  figures,  it  is  seen  that  the 
solutions  obtained  are  accurate  when  either  cubic 
or  linear  wavelet-like  basis  functions  are  used. 

Although  the  ability  of  any  basis  function  to 
accurately  model  an  arbitrary  function  is  quite 
important,  the  wavelet-like  basis  also  has  other 
advantages.  Previously,  wavelet-like  functions 
have  been  shown  to  have  extremely  good 
convergence  and  stability  properties.  After 
diagonal  preconditioning,  the  condition  number 
of  the  system  matrix  was  calculated.  Figure  20 
illustrates  how  the  condition  number  varies  as 
the  number  of  basis  functions  is  increased. 
Because  the  condition  number  of  the  system 
matrix  is  much  smaller  for  the  wavelet-like  bases 
in  comparison  with  the  rapidly  rising  condition 
number  when  the  traditional  basis  is  used,  the 
condition  numbers  when  the  linear  and  cubic 
wavelet-like  basis  functions  are  used  are  shown 
separate  in  Fig.  21.  The  benefits  of  this  low 
condition  number  are  especially  evident  when 
looking  at  the  number  of  steps  required  for 
convergence  of  the  conjugate  gradient  method. 
In  Fig.  22,  the  number  of  steps  required  for 
convergence  of  the  conjugate  gradient  method  is 
plotted  as  the  number  of  basis  functions  is 
increased.  With  approximately  225  basis 
functions,  the  traditional  basis  requires  78  steps 
for  convergence;  this  is  in  contrast  to  only  34  for 
225  linear  wavelet-like  basis  functions  and  only 
18  for  253  cubic  wavelet-like  basis  functions. 
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Solution  Along  the  Line  x=1 .5 
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Fig.  17.  Numerical  (with  961  linear  2D  wavelet-like  basis) 
and  analytic  solutions  along  the  line  x  =  1.5. 

Solution  along  the  line  y=1.5 
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Fig.  18.  Numerical  and  analytic  solution  along  the  line 
y=1.5. 

U(x.y) 


Fig.  19.  Numerical  solution  when  25  cubic  wavelet-like 
functions  are  used. 


Condition  number 


Fig.  20.  Condition  number  comparison  of  wavelet-like  and 
traditional  basis. 


Condition  number 


Fig,  21 .  Condition  number  when  wavelet-like  bases  Are 
used. 


Steps  Required  for  Convergence 
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Fig.  22.  Steps  required  for  convergence. 


Hutchcraft  and  Gordon:  Two-Dimensional  Wavelet-Like  Basis  Functions 


V.  CONCLUSION 

It  has  been  shown  that  one-dimensional 
wavelet-like  basis  functions  can  be  multiplied  to 
obtain  two-dimensional  basis  functions  that  give 
accurate  results  when  they  are  used  to  obtain 
numerical  solutions.  The  stability  of  the 
condition  number  and  the  rapid  convergence 
when  the  conjugate  gradient  method  is  used  have 
also  been  shown  to  be  two  advantages  of  using 
either  linear  or  higher  order  wavelet-like  rather 
than  traditional  basis  functions.  As  is  the  case 
with  higher  order  traditional  basis  functions, 
fewer  cubic  wavelet-like  basis  functions  are 
required  for  high  accuracy.  One  disadvantage  of 
this  method  is  that  the  mesh  would  resemble 
more  of  a  finite  difference  mesh  rather  than  the 
triangular  patches  that  are  typically  associated 
with  the  finite  element  method;  however,  non- 
uniform  spacing  is  still  rather  easily 
accomplished  with  the  wavelet-like  method. 
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Abstract--  This  paper  investigates  some  aspects  on  noise  and 
vibrations  of  electrical  machinery  based  upon  the  coupling  between 
the  magnetic  field  and  the  mechanical  deformation  in  the  stator. 
This  coupling  is  typically  considered  by  using  reluctance  forces. 
Since  the  deformations  occurring  are  small  compared  to  the 
machine's  dimensions,  there  is  no  feedback  to  the  magnetic  system 
in  numerical  models.  However,  stator  deformations  are  caused  not 
only  by  reluctance  forces,  but  also  by  magnetostriction  effect  of  the 
stator  iron.  Magnetostriction  is  one  of  the  main  causes  of  noise  in 
electromagnetic  systems  particularly  when  the  flux  density  is  above 
1.5  Teslas.  Here,  we  develop  numerical  models  that  incorporate 
magnetostriction  effects  and  all  other  possible  electromechanical 
forces  and  related  material  interactions.  Magnetostriction  presents  a 
problem  at  all  levels  of  frequencies.  At  frequencies,  particularly  2E, 
magnetostrictive  forces  are  undesirable  and  can  be  large  as  well  as 
generate  acoustic  noise,  which  can  impede  the  system’s 
performance. 

The  magnetostrictive  deformations  can  be  calculated  based 
upon  the  magnetic  field.  If  the  magnetostrictive  deformations  are 
slightly  higher  than  the  magnitude  of  the  deformations  caused  by 
the  reluctance  forces,  there  will  be  a  need  for  feedback  to  the 
magnetic  system.  In  order  to  account  for  this  effect,  the 
magnetostriction  characteristic  of  iron  >.(H)  is  needed.  The 
dependency  of  permeability  on  mechanical  stress  must  be  accounted 
for  and  be  built  into  a  strong  coupling  scheme.  Implementation 
results  on  a  2-hp,  permanent  magnet  motor  indicate  that 
magnetostrictive  forces  are  significant  and  must  be  accounted  for  in 
the  electromagnetic  system’s  design  stage. 

Index  Terms —  Coupled  problems,  transient  FE 
formulation,  magnetoelastic  problem,  Magnetostriction. 

L  INTRODUCTION 

A  numerical  2-D  tensor  finite  element  model  for 
coupled  magneto-mechanical  problems  is  presented. 
The  developed  model  can  be  used  to  evaluate  force 
components  in  electromagnetic  devices  including  those 
due  to  the  magnetostrictive  phenomenon. 
Magnetostriction  is  a  property  of  ferrous  alloys  in 
\vhich  the  material  will  exhibit  strain  in  the  presence  of 
magnetic  field.  This  strain  is  in  addition  to  any  other 
strains  that  might  exist  as  a  result  of  electrical  and/or 
mechanical  forces  in  devices.  At  low  frequencies, 
magnetostrictive  forces  are  undesirable.  They  can  be 
large  and  generate  acoustic  noise  in  electromagnetic 
systems  [1]. 


Magnetostriction  is  built  into  the  coupled  system 
using  a  force  distribution  that  is  added  to  magnetic 
forces.  Magnetostriction  forces  mean  the  set  of  forces 
that  induce  the  same  strain  in  the  material  as 
magnetostriction  does.  The  numerical  implementation 
of  this  approach  is  similar  to  the  way  thermal  stresses 
are  applied.  To  evaluate  thermal  stresses,  the  thermal 
expansion  of  the  free  body  is  calculated  based  upon  the 
temperature  distribution  [2],  The  thermal  stresses  are 
found  by  deforming  the  expanded  body  back  into  its 
original  shape.  Magnetostriction  forces  are  calculated 
by  expansion  of  the  free  body  due  to  magnetostriction 
based  upon  the  magnetic  flux  density.  The 
magnetostriction  forces  are  found  as  the  reaction  to  the 
forces  needed  to  deform  the  expanded  body  back  into 
the  original  boundary  conditions. 

In  finite  element  models,  this  can  be  performed  for 
each  element  through  the  element’s  nodal 
displacements  with  respect  to  the  element’s  centre  point 
using  the  element's  flux  density  and  the 
magnetostriction  characteristic  of  the  material,  X(H). 
The  element’s  strains  are  converted  into  nodal 
displacements  considering  the  center  of  the  element  as 
a  fixed  point. 

Magnetostriction  in  magnetic  materials,  such  as 
electrical  silicon  steel  sheets,  is  one  of  the  main  causes 
of  noise  and  vibration  in  electric  machinery  and 
equipment  for  which  there  is  no  proven  remedy.  It  is  an 
important  factor  in  governing  the  magnetic  properties 
of  magnetic  materials.  In  order  to  achieve  lower  losses 
and  noise  levels  in  electrical  machines,  we  need  to 
know  the  magnetostrictive  effects  in  the  design  stage. 
Some  studies  have  been  done  on  the  reduction  of 
magnetostriction  and  in  orienting  the  steel  sheet  in  the 
same  direction  as  the  applied  tension  to  generate  180° 
domains  in  the  direction.  Even  with  such  an  effort,  the 
improvements  in  noise  and  loss  levels  in  actual 
equipment  are  only  marginal  especially  under  rotating 
field  conditions  where  the  effects  of  magnetostriction 
are  greater.  If  the  magnetic  flux  changes  direction  or  is 
rotating,  B  and  H  will  not  be  parallel  and  therefore 
tensor  values  of  magnetic  reluctivities  should  replace 
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the  single  values.  Furthermore,  it  is  possible  that  the 
magnetostriction  itself  or  the  accompanying  material 
property  changes,  may  be  small  in  one  direction,  they 
can  significantly  vary  when  the  domain  structure 
changes  in  the  magnetic  flux  direction  [3,4]. 

Magnetization  characteristics  data  available  from 
electrical  steel  manufacturers  today  are  usually 
available  for  lower  field  levels  (typically  around  1.5  T 
in  test  samples)  and  such  data  do  not  exist  under  any 
stress  level.  The  magnetic  field  causes  elastic 
defonnation  and  the  mechanical  stress  changes  the 
magnetization  curve  and  the  hysteresis  loop.  An  initial 
effort  on  magnetostriction  measurement  was  conducted 
on  test  samples  to  obtain  magnetization  curves  under 
several  stress  levels  were  obtained  [5,  6].  The  obtained 
curves  are  subsequently  used  in  the  numerical  models 
described  here.  Implementation  results  on  a  2-hp, 
permanent  magnet  motor  indicate  that  magnetostrictive 
forces  are  significant  and  amount  to  more  than  50% 
force  level  increase. 

II.  DEVELOPMENT  OF  THE  NUMERICAL 
MODEL 

Implementing  the  finite  element  discertization,  we 
can  obtain  the  numerical  solution  of  Maxwell’s 
equations.  As  a  result  of  the  finite  element 
discertization,  the  following  matrix  equations  for  both 
the  electromagnetic  and  mechanical  problems  obtained 
[1]: 


[s]M  =  k] 

[K][t/]  =  [F]  (2) 

where  S  and  K  are  the  eleetromagnetie  and  mechanical 
stiffness  matrices  and  F  is  the  force  vector. 

Solving  equations  (1)  and  (2)  yields  the  magnetic 
vector  potential,  A  and  the  displacement  U.  One  could 
easily  notice  that  equations  (1)  and  (2)  are  coupled 
through;  a)  the  magnetic  force  and,  b)  the  variation  of 
the  material  permeability  with  stress  [4-6].  Therefore  a 
coupled  solution  model  is  necessary.  Utilizing  the 
coupled  solution,  the  magnetic  force  in  the  direction  of 
the  displacement,  U  is  equal  to  the  variation  of  the 
magnetic  energy  with  respect  to  U  while  the  magnetic 
flux  is  held  constant.  When  implementing  the  virtual 
work  method  [7],  holding  the  magnetic  flux  constant  is 
done  by  holding  the  difference  of  the  magnetic  vector 
potential  between  two  nodes  constant.  In  one  element. 


the  contribution  to  the  force  acting  on  a  given  node  is 
as  follows: 


Re  5C/  i 


dU 


J  OCT  OU 

where  Re  is  the  element  area,  \G\  is  the  determinant  the 
local  Jacobian  derivatives  matrix  G. 

The  variation  of  the  permeability  with  mechanical 
stress  is  taken  into  consideration  in  the  third  term  of 
equation  (3).  This  corresponds  to  the  force  component 
of  magnetostriction  origin.  The  third  term  in  equation 
(3)  is  the  force  component  due  to  the  magnetostriction 
phenomenon  in  one  element  acting  on  a  given  node  is 
re-written  as: 


Re  0 


da 


(l  +  u)(l-2u) 


)  |Gl]  dxdy 

(4) 


The  total  effect  on  a  given  part  in  the  device  can  be 
calculated  by  adding  the  elemental  contributions.  It 
should  be  noticed  that  in  evaluating  the  force  equations, 
the  permeability  changes  with  stress  must  be  taken  into 
consideration  in  computing  the  field.  This  equation, 
equation  (3),  gives  only  the  force  component  due  to 
magnetostriction  in  one  element.  All  other  forces 
(electromagnetic,  mechanical,  etc.)  must  be  calculated 
in  the  standard  way  given  by  the  problem. 

For  the  full  reluctivity  tensor  model,  the 
magnetostrictive  force  term  with  tensor  reluctivity  is 
written  as  follows  [1,5,  6]: 


where  E  is  the  Young's  modulus  and  u  is  the  Poisson's 
ratio.  |G|  is  the  determinant  of  the  local  Jacobian 
matrix  [1]. 

Results  from  an  initial  effort  [1,  5,  6]  on  the 
measurement  of  magnetization  curves  under 
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mechanical  stress  were  used  in  the  implementation  to 
the  2-hp  motor  given  here.  From  the  measurements  and 
at  a  given  value  of  flux  density,  within  the  proportional 
limit  of  ferrous  materials,  the  permeability  will  increase 
as  the  applied  stress  increases.  However,  a  reduction  in 
the  material  permeability  will  occur  as  the  stress 
increases  further.  The  reduction  in  permeability  will 
continue  as  the  stress  on  the  material  passes  through  the 
elastic  limit.  A  more  pronounced  reduction  in 
permeability  would  occur  as  the  increase  in  stress  takes 
the  material  beyond  the  elastic  limit  (plastic  range  up  to 
the  yield  point).  At  each  level  of  stress,  permeability 
versus  magnetic  field  will  vaiy.  This  would  also  mean 
that  the  hysteresis  loop  would  shift  affecting  the 
behavior  of  the  device  and  cause  a  noticeable  and/or  a 
permanent  change  in  performance. 

III.  IMPLEMENTATION  AND  RESULTS 

The  numerical  model  was  implemented  on  a  6- 
pole,  three-phase,  2-hp  surface  mounted  permanent 
magnet  motor.  One  pole  of  this  motor  is  shown  in 
figure  (1).  The  figure  also  shows  the  winding 
arrangement  in  the  double-layered  slots.  Three-phase 
excitation  is  applied  to  the  winding  and  the  half  slot 


current  density  in  the  windings.  A  360-degree  model 
for  the  motor  was  used  in  developing  the  strongly 
coupled  transient  solution.  The  transient  solution  was 
obtained  by  rotating  the  rotor  every  quarter  degree  for  a 
whole  pole-pitch. 


Fig.  (1)  One  pole  of  the  2-hp  Motor 


Fig.  (2)  Flux  density  on  deformed  shape  from  transient  solution  without  Magnetostrictive  forces. 
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Sample  flux  density  solutions  at  various  rotor 
positions  are  shown  in  figure  (2).  The  measured 
magnetization  curves  under  stress  levels  were  used  in 
the  numerical  model.  Magnetostriction  was  included 
by  utilizing  the  curves  and  the  stress  levels  obtained 
from  the  numerical  solution  and  a  table  look-up 


procedure  incorporating  the  measured  curves.  Figure 
(2)  shows  the  flux  density  on  the  deformed  shape  at 
six  instances  from  the  transient  solution  without  the 
magnetostriction  forces  included.  On  the  other  hand, 
figure  (3)  shows  the  flux  density  on  the  deformed 
shape  with  the  magnetostriction  forces  added. 


Fig.  (3)  Flux  density  on  deformed  shape  from  transient  solution  with  Magnetostrictive  forces. 


Sample  flux  density  solutions  at  various  rotor 
positions  are  shown  in  figure  (2).  The  measured 
magnetization  curves  under  stress  levels  were  used  in 
the  numerical  model.  Magnetostriction  was  included  by 
utilizing  the  curves  and  the  stress  levels  obtained  from 
the  numerical  solution  and  a  table  look-up  procedure 
incorporating  the  measured  curves,  figure  (2)  shows 
the  flux  density  on  the  deformed  shape  at  six  instances 
from  the  transient  solution  without  the  magnetostriction 
forces  included.  On  the  other  hand,  Figure  (3)  shows 
the  flux  density  on  the  deformed  shape  with  the 
magnetostriction  forces  added. 

The  nodal  forces  at  several  nodes  on  the  stator  are 
shown  in  figures  (4)  and  (5).  It  can  be  easily  seen  from 


these  two  figures  that  the  magnetostriction  force  values 
are  significant  and  include  additional  pulsation  as  an 
indication  that  the  magnetostriction  effect  adds  more 
noise.  These  magnetostriction  forces  are  significant  and 
cause  additional  deformations  in  the  iron  and  contribute 
added  noise.  These  forces  represent  nearly  50%  force 
value  increase  due  to  magnetostriction.  This  is 
significant  and  must  be  accounted  for  in  the  design  and 
analysis  stages  in  electric  machinery.  Furthermore,  the 
torque  pulsation  has  increased  as  can  be  depicted  in 
figure  (6).  This  figure  shows  the  torque  from  the 
transient  solution  with  and  without  magnetostriction. 


Force  (N) 
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IV.  Conclusion 

We  presented  an  investigation  on  the  causes  of 
noise  and  vibrations  of  electrical  machinery  based  upon 
a  strongly  coupled  numerical  model  comprising 
magnetic  field  and  the  mechanical  deformation  in  the 
stator.  The  deformations  in  the  stator  are  caused  not 
only  by  reluctance  forces,  but  also  by  magnetostriction 
effect  of  the  stator  iron.  We  noticed  an  increase  in  the 
deformations  in  the  back  iron  and  the  force  values 
causing  it. 

Measured  data  on  magnetization  curves  under 
stress  levels  were  used  in  obtaining  the  nonlinear 
transient  numerical  solutions  presented  here.  A 
feedback  to  the  magnetic  system  was  performed 
utilizing  the  dependence  of  permeability  on  mechanical 
stress  from  the  measured  data.  In  order  to  account  for 
this  effect,  the  magnetostriction  characteristic  of  iron 
A.(H)  was  used.  The  dependency  of  permeability  on 
mechanical  stress  was  accounted  for  and  built  into  the 
strong  coupling  scheme.  Implementation  results  on  a  1- 
hp,  permanent  magnet  motor  indicate  that 
magnetostrictive  forces  are  significant  and  must  be 
considered  design  stage. 
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Abstract:  The  design,  construction  and  testing  of 
new  hybrid  junctions  to  perform  the  usual 
functions  of  summation  and  subtraction  of 
incoming  signals,  not  only  over  the  fundamental 
mode  but  also  to  include  the  first  few  higher 
order  modes,  are  important  tasks  for  both  military 
and  civilian  applications.  The  performance  of  the 
proposed  junctions  is  evaluated  experimentally 
and  their  relevance  for  a  variety  of  microwave 
measurements  and  tracking  radars  suggested. 

Keywords:  Microwave  Hybrid  Junctions,  Radar, 
Higher  Order  Modes 

1.  Introduction 

Hybrid  junctions  or  combining  circuits 
are  primarily  magic  tees,  hybrid  rings  and 
directional  couplers  that  are  employed  in  a 
variety  of  electromagnetic  engineering 
applications  for  duplexing  and  mixing  of  signals 
as  well  as  impedance  measurements.  They  are 
also  used  for  signal  sampling  as  well  as 
amplitude-phase  bridge  networks  for  on-line 
monitoring  and  quality  control  of  physical  and 
dielectric  properties  of  sheet  materials  during  the 
manufacturing  and  processing  stages  [1-2]. 

The  operation  of  these  junctions  is  based 
on  the  principle  that  analog  signals  incident  at 
some  ports  are  added  in  or  out  of  phase  at  other 
“sum”  and  “difference”  ports,  respectively.  Thus 
it  is  possible  to  redirect  signals  to  some  ports  and 


isolate  others.  This  is  particularly  important  in 
radar  systems  as  will  be  discussed  later. 

Applying  the  reciprocity  principle  to  the 
four  port  magic  tee  shown  schematically  in  Fig. 
la,  we  see  how  signals  incoming  at  the  collinear 
ports  3  and  4  add  in  phase  at  port  1  (parallel  arm 
or  P  arm)  and  out  of  phase  at  port  2  (shunt  arm  or 
S  arm).  Conversely,  a  signal  fed  into  port  1  splits 
equally  between  ports  3  and  4  with  the  outputs 
equal  in  amplitude  and  opposite  in  phase.  On  the 
other  hand,  a  signal  fed  into  port  2  also  splits 
between  ports  3  and  4  with  the  outputs  equal  in 
amplitude  and  phase.  Thus  two  transmitters  with 
equal  power  connected  to  ports  2  and  1  will 
produce  twice  as  much  power  at  port  4  and  none 
at  port  3. 

The  hybrid  ring  shown  in  Fig.  lb  operates 
on  similar  principles  where  signals  add  in  phase 
if  they  are  an  even  number  of  half  waveguide 
wavelengths  (A-g/2)  apart  and  out  of  phase  if  they 
are  an  odd  number  of  half  waveguide 
wavelengths  (Xg/2)  apart.  With  a  signal  fed  into 
port  1,  two  outputs  with  equal  amplitude  and 
opposite  phase  will  appear  at  ports  3  and  4  and 
zero  output  at  port  2.  On  the  other  hand,  a  signal 
fed  into  port  2  produces  two  signals  of  equal 
amplitude  and  phase  at  ports  3  and  4  and  cancels 
out  at  port  1 .  Also  a  signal  fed  into  port  3  splits 
with  equal  amplitude  and  phase  at  ports  2  and  1, 
and  no  signal  appears  at  port  4.  Similarly,  a 
signal  fed  into  port  4  splits  with  equal  and 
opposite  phase  at  ports  1  and  2  and  no  signal 
appears  at  port  3. 
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Figure  1.  Schematic  Diagram  of  Four-Port  Hybrid  Junctions, 
(a)  Magic  Tee  (b)  Hybrid  Ring  (c)  Directional  Coupler 


waves 

add 


The  two  hole  directional  coupler  shown  in 
Fig.  Ic  has  two  identical  sampling  holes  between 
the  primary  and  secondary  waveguides  so  that 
forward  signals  add  in  phase  at  port  4  and  cancel 
in  port  2,  and  so  on. 

The  disadvantages  of  all  three  hybrid 
junctions  in  the  rectangular  waveguides  shown  in 
Fig.  1  is  the  residual  or  leakage  coupling  between 
the  collinear  arms  which  are  supposed  to  be 
isolated  from  each  other  at  the  design  frequency 
or  within  the  passband  of  the  waveguide.  At 
frequencies  higher  than  the  design  frequency,  but 
still  above  the  cutoff  frequency  fc  of  the 
fundamental  TE  or  TM  mode  of  the  waveguide, 
this  coupling  increases  so  dramatically  that  the 
“magic  tee”  loses  its  “magic  power”.  In  such 


cases,  higher  order  modes  are  excited  in  all  three 
types  of  hybrid  Jxmctions,  and  a  solution  needs  to 
be  found  for  both  rectangular  and  circular 
waveguide  hybrid  junctions. 

The  problem  of  redesigning  hybrid  junctions  to 
perform  analog  sum  and  difference  signals  over 
the  fundamental  as  well  as  higher  order  modes  is 
very  critical  in  certain  types  of  tracking  radars 
which  are  described  in  detail  elsewhere  [3].  As  a 
review  example,  the  conventional  amplitude- 
comparison  monopulse  tracking  radar  for  a  single 
angle  coordinate  employs  two  squinted  antenna 
feeds  which  are  connected  to  a  magic  tee  so  that 
the  sum  (S)  and  difference  (A)  signals  appear  at 
the  two  output  ports  as  shown  in  Fig.  2,  [3].  On 
reception,  the  outputs  of  the  sum  and  difference 


Figure  2.  Schematic  Diagram  of  Two-Coordinate  (Azimuth 
and  Elevation)  Amplitude-Comparison  Monopulse  Tracking 
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ports  are  each  heterodyned  to  an  intermediate 
frequency  (IF)  and  amplified  in  a  super 
heterodyne  receiver  where  the  local  oscillator 
(LO)  is  shared  by  the  two  channels  which  are 
maintained  with  the  same  amplitude  and  phase 
characteristics.  The  transmitter  is  connected  to 
the  sum  port  of  the  hybrid  junction  while  a 
duplexer  is  inserted  in  the  sum  channel  to  protect 
the  sum  channel  receiver.  The  outputs  of  the  sum 
and  difference  channels  are  inputted  to  a  phase 
sensitive  detector  which  is  a  nonlinear  device  that 
compares  the  two  signals  at  the  same  frequency. 
The  output  of  this  detector  is  the  angle-error 
signal  0q  whose  magnitude  is  proportional  to  the 
difference  between  the  target  and  boresight 
angles,  while  the  polarity  of  the  output  indicates 
the  direction  of  the  angle  error  relative  to  the 
boresight.  This  technique  is  extended  to  two- 
angle  coordinate  measurement  by  employing  a 
cluster  of  four  (A,  B,  C  and  D)  feed  horns 
(feeding  a  Cassegrain  dish  or  space-fed  phased 
array)  which  generate  four  partially  overlapping 
(squinted)  beams  as  shown  schematically  in  Fig. 
2.  The  sum  pattern  is  found  from  (A+B+C+D) 
while  the  azimuth  difference  pattern  is  obtained 
from  (A+B)  -  (C+D)  and  for  the  elevation  plane 
the  difference  pattern  is  found  from  (B-t-D)  - 
(A+C).  Range  information  is  extracted  from  the 
output  of  the  sum  channel  after  envelope 
detection  and  is  employed  for  range  tracking. 

Although  the  four-horn  feed  system  is 
simple,  it  cannot  provide  the  sum  and  difference 
patterns  that  are  independently  optimized.  The 
sum  pattern  should  have  maximum  gain  on  axis 
which  requires  uniform  aperture  illumination.  On 
the  other  hand,  the  difference  pattern  should  have 
an  aperture  illumination  which  results  in  a  large 
slope  of  the  error  signal  at  the  beam  crossover. 
For  this,  some  improvement  has  been 
incorporated  in  some  precision  tracking  radars  by 
using  a  five-horn  feed  consisting  of  one  horn  in 
the  middle  to  produce  the  sum  pattern  surrounded 
by  four  horns  to  generate  the  difference  patterns. 
The  analysis  in  the  literature  indicates  that  the 
sizes  of  the  four  horns  generating  the  difference 
pattern  should  be  twice  as  big  as  the  central  horn 


generating  the  sum  pattern.  Instead  of  further 
improvement  by  adopting  a  twelve-horn  design 
to  satisfy  the  optimization  criteria  and  avoid 
complexity,  it  has  been  the  practice  in  many 
types  of  radar  to  use  higher  order  waveguide 
modes  to  obtain  better  control  of  the  sum  and 
difference  patterns.  An  example  of  such  systems 
is  the  AN/MPS-39  for  multiple  missile  tracking 
or  the  MOTR  (Multiple  Object  Tracking  Radar) 
which  employ  a  four-horn  triple-mode  feed.  For 
these  multimode  feed  systems  and  for  other 
systems,  which  employ  multiple  frequencies  or 
frequency  agility/diversity  to  smooth  out  the 
angle  error  due  to  glint,  it  is  necessary  to  develop 
new  multimode  hybrid  junctions  to  produce  the 
sum  and  difference  patterns  which  is  the  main 
purpose  of  this  paper. 

2.  Outline  of  Proposed  Hybrid  Junctions 

The  primary  objective  of  this  project  is  to 
design  and  develop  new  types  of  multimode 
hybrid  junctions  in  rectangular  and  circular 
waveguides  which  permit  the  extraction  of 
separate  sum  and  difference  signals  for  the 
fundamental  as  well  as  higher  order  TE  and  TM 
modes  in  multimode  signals.  These  are  shown 
schematically  in  Fig.  3  for  rectangular  and 
circular  waveguides.  The  basic  approach  is  to  use 
cascaded  hybrid  junctions,  one  for  each  mode  of 
interest,  so  that  the  sum  and  difference  signals 
can  be  displayed  and  processed  separately  for 
each  mode.  Starting  with  Fig.  3a,  we  see  that  the 
collinear  arms  of  the  magic  tee  in  the  primary 
rectangular  waveguide  accommodate  the 
fundamental  as  well  as  all  the  higher  order  modes 
excited  above  frequencies  fo  relative  to  (fcjxE  lo  of 
the  fundamental  mode.  A  second  mode  magic  tee 
is  inserted  in  cascade  with  its  collinear  arms 
coupled  to  the  primary  waveguide  so  as  to  extract 
the  second  and  all  higher  order  modes,  but  not 
the  primary  mode,  since  its  dimensions  are  such 
that  the  primary  mode  is  below  cutoff.  Only  the 
second  mode  is  processed  in  the  second  mode 
magic  tee  since  the  third  and  higher  order  modes 
are  extracted  in  the  third  mode  magic  tee  prior  to 
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H-plane  dimencions  foxUnp«r»!l*l(^  eumnungann*  ofth«m»gict*ef 

Some  E'plsnt  dimensions  for  the  seiies  ^  difTerence  ams  of  the  magic  tees  maybe  eqiut 
depending  on  the  mode  selected.  The  difference  aims  are  perpendicular  to  the  plane  of  the  paper. 


H-plane  dimensions  aj>  aj>  t.>  a^  for  the  parallel  summing  arms  of  the  hybrid  junctions. 

Some  E-plane  dimensions  for  the  series  difference  arms  of  the  hybrid  junctions 

maybe  equal,  depending  on  the  mode  selected. 


Figure  3.a.  Cascaded  Mode-Selective  Rectangular  WG  Magic  Tees 


Figure  3.c.  Cascaded  Mode-Selective  Hybrid  Junctions 


Diameters  aj>  a^  provide  selection  ofsuccessively  higher  modes  in  a  manner 
similar  to  cascading  hi^pass  filters  with  successively  higher  cuUofF frequencies. 


H-plane  dimensions  for  the  parallel  (P),  summing  arms  of  the  hybrid  junctions. 

Some  E-plane  dimensions  for  the  seiies  difference  arms  of  the  hybrid  junctions 
maybe  equal,  depending  on  the  mode  selected. 


Figure  3.b.  Cascaded  Mode-Selective  Circular  WG  Magic  Tees 


Figure  3.d.  Cascaded  Mode-Selective  Directional  Couplers 
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Figure  4.  Relative  Cutoff  Frequencies  of  Higher  Order  Modes  in  Waveguides 


processing,  etc.  It  is  obvious  that  the  waveguide 
dimensions  decrease  as  the  selected  mode  to  the 
primary  waveguide  so  as  to  extract  the  second 
and  all  higher  order  modes,  but  not  the  primary 
mode,  since  its  dimensions  are  such  that  the 
primary  mode  is  below  cutoff.  Only  the  second 
mode  is  processed  in  the  second  mode  magic  tee 
since  the  third  and  higher  order  modes  are 
extracted  in  the  third  mode  magic  tee  prior  to 


processing,  etc.  It  is  obvious  that  the  waveguide 
dimensions  decrease  as  the  selected  mode 
number  increases  so  that  the  procedure  will  be 
limited  to  only  the  first  few  modes  to  avoid 
network  complexity.  The  actual  modes  to  be 
selected  depend  on  the  usual  rectangular 
waveguide  dimensions  (a  and  b)  as  illustrated  in 
Fig.  4a  while  Fig.  4b  shows  a  similar  spectrum 
for  circular  waveguide  modes  to  which  the 
technique  will  be  extended  later.  The  initial 
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Figure  5.  Schematic  Diagram  of  Adjustable  Length  Rectangular  Waveguide 
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design  will  therefore  concentrate  on  the 
rectangular  ^vaveguide  dimensions  (a  and  b)  of 
each  magic  tee  such  that  all  few  modes  of  interest 
are  excited  within  the  available  frequency  band 
of  the  sweep  oscillator. 

One  drawback  of  the  proposed  technique  is  the 
effect  of  the  added  junctions  on  the  voltage 
standing  wave  ratio  (VSWR)  in  the  primary  and 
branch  waveguides  of  the  multimode  hybrid 
Junctions  [3].  Although  slide  screw  tuners  as 
well  as  rotary  vane,  ferrite  or  other  types  of  phase 
shifters  can  be  used  to  minimize  the  VSWR,  an 
alternative  method  is  proposed  to  avoid  heating 
of  tuning  elements  or  power  leakage  from  rotary 
vane  slots,  particularly  in  high  power  radar 
applications.  This  is  shown  schematically  in  Fig. 
5  where  the  waveguide  length  is  varied  by  a 
sliding  mechanical  technique  which  provides 
adequate  phase  shift.  A  threaded  sleeve,  which  is 
turned  using  two  outer  rings  attached  to  a  circular 
pipe,  introduces  spacing  between  two  adjacent 
waveguides  which  slide  inside  an  outer 
waveguide  provided  for  that  purpose.  This 
adjustable  waveguide  length  method  is  novel  and 
useful  for  testing  the  proposed  hybrid  junctions 
for  each  mode. 


3.  Experimental  Procedure  and  Results 

The  three  hybrid  junctions  were 
constructed  and  tested  only  for  the  fundamental 
TEio  as  well  the  first  higher  TE20  modes  only. 
The  testing  procedure  employed  a  2-18.6  GHz 
Hewlett  Packard  sweep  oscillator  whose  output 
was  displayed  on  a  spectrum  analyzer  to 
determine  the  amplitude  of  each  mode  excited  in 
the  passband  of  the  main  feed  waveguide.  This 
output  was  then  fed  into  the  collinear  arms  of 
each  hybrid  junction  and  the  signals  at  the  sum 
and  difference  arms  of  each  junction  were  then 
measured  and  compared  with  the  expected 
values  of  the  higher  order  mode.  Slotted 
waveguides  and  two-way  directional  couplers 
were  inserted  in  series  with  the  collinear  arms  of 
each  junction  to  measure  the  VSWR  with  and 


without  the  adjustable  length  waveguide.  The 
main  test  frequencies  were  3,  6,  9  and  18  GHz, 
although  secondary  frequencies  were  employed 
as  well.  The  maximum  VSWR  observed  was 
1.12  while  the  maximum  deviation  of  the  TE20 
mode  amplitude  from  the  expected  value  was 
7.6%  for  the  rectangular  magic  tee,  9.1%  for  the 
hybrid  rind  and  6.9%  for  the  directional  coupler. 

4.  Conclusions 

New  hybrid  junctions  were  designed, 
constructed  and  tested  for  the  first  two 
propagating  modes  in  rectangular  waveguides  for 
standard  laboratory  measurements  as  well  as 
radar  applications.  The  design  of  the  various 
waveguide  sections  will  be  optimized  in  the 
future  using  the  method  of  moments  [4]  while  the 
proposed  variable-length  waveguide  proved 
helpful  in  the  proposed  hybrid  junctions  to 
minimize  mismatch  losses.  The  extension  of 
multimode  hybrid  junctions  to  circular 
waveguides  will  be  investigated  separately  once 
the  rectangular  waveguide  multimode  junctions 
are  tested  over  several  modes. 
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Abstract 

Analytical  solution  is  derived  to  the  problem  of  scattering  of 
electromagnetic  plane  wave  by  an  array  of  dielectric  spheres 
each  coated  with  a  dielectric  shell.  The  incident,  scattered 
and  transmitted  electric  and  magnetic  fields  are  expressed  in 
terms  of  the  vector  spherical  wave  ilinctions.  The  vector 
spherical  translation  addition  theorem  is  applied  to  impose 
the  boundary  conditions  on  the  surface  of  various  layers. 
Numerical  results  are  computed  and  presented  graphically 
for  the  radar  cross  sections  of  several  configurations  of 
spheres  system  with  multi  dielectric  layers. 

1.  Introduction 

Many  authors  have  studied  the  scattering  of  electromagnetic 
plane  wave  by  a  dielectric  sphere  coated  with  a  dielectric 
shell.  Aden  and  Kerker  [1]  obtained  analytical  expressions 
to  the  scattering  of  electromagnetic  plane  wave  by  a 
dielectric  sphere  coated  with  a  concentric  spherical  shell  of 
different  dielectric  materials,  while  Scharfman  [2]  presented 
numerical  results  for  the  special  case  of  a  small  electrical 
radius  (ka<l)  dielectric  coated  conducting  sphere.  It  was 
found  in  those  early  studies  that  the  presence  of  dielectric 
coatings  leads  to  substantial  increase  in  the  backscattering 
cross  section  for  an  appropriate  choice  of  the  dielectric 
constant  and  thickness  of  the  coating  relative  to  that  of 
uncoated  sphere.  Further,  Wait  has  extended  the  solution  to 
the  case  of  scattering  by  a  radially  inhomogeneous  sphere 
[3],  while  a  numerical  solution  using  the  method  of 
moments  obtained  by  Medgyesi-Mitschang  and  Putnam  for 
the  case  of  dielectric-coated  concentric  sphere  [4].  More 
recently,  an  exact  solution  of  electromagnetic  plane  wave 
scattering  by  an  eccentric  multilayered  sphere  was 
developed  by  Lim  and  Lee  [5].  Numerous  papers  on  the 
scattering  from  systems  of  spheres  of  various  natures  in 
close  proximity  have  been  treated  by  numerous  researchers 
[6-11]. 

Up  to  now,  there  has  been  no  analytical  or  numerical 
solution  to  the  problem  of  scattering  of  electromagnetic 
plane  wave  by  an  array  of  conducting  spheres  each  coated 


with  a  dielectric  layer.  In  this  paper,  we  extend  the  solution 
of  scattering  by  two  dielectric  spheres  covered  with  a 
dielectric  shell  [9]  to  the  case  of  scattering  by  a  system  of 
dielectric  spheres  each  covered  with  a  dielectric  shell.  The 
solution  to  this  problem  has  many  practical  applications 
since,  for  example,  it  may  be  used  to  study  the  scattering  by 
complex  objects  simulated  by  a  collection  of  spheres  [12], 
and  it  may  also  be  used  to  check  the  accuracy  of  numerical 
solutions. 

From  the  design  point  of  view,  the  backscattering  cross 
section  of  an  array  of  N  dielectric  coated  spheres  can  be 
controlled  to  exploit  multiple  resonances  by  optimizing  the 
multivariables  of  the  system.  These  include  the  size  and 
location  of  each  sphere,  number  of  dielectric  layers  coating 
each  sphere  as  well  as  the  thickness  and  relative  dielectric 
constant  of  each  layer  as  already  done  for  conducting 
cylinders  [13]. 

2.  Formulation  of  the  Problem 


Consider  a  linear  array  of  N  dielectric  spheres  each  coated 
with  a  dielectric  shell  and  having  different  radii  and  unequal 
spacing  with  centers  lying  along  the  z  axis,  as  shown  in  Fig. 
1 .  Electromagnetic  plane  wave  of  unit  electric  field 
intensity,  whose  propagation  vector  K  lies  in  the  x-z  plane 
and  makes  an  angles  with  the  z-axis,  is  assumed  to  be 
incident  on  the  spheres.  Its  incident  electric  and  magnetic 
fields  are 


//j  =  “  —  (cos ax-sinaz) 


(1) 

(2) 


with  k  being  the  wave  number,  and  z  are  the  unit 


vectors  along  the  x,  y  and  z  axes,  respectively,  and  tj  is  the 
surrounding  medium  intrinsic  impedance.  The  incident 
electric  and  magnetic  fields  may  be  expanded  in  terms  of 
spherical  vector  wave  functions  around  the  center  of  the  p'** 
sphere  as 
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oo  m=n  _ 

nH, (.r^,0p ,(l>p)  =  j^Yj  (4) 

"  ■^Qpi’n,n)WJl{r^,e^,(l>^)] 

where  and  N^„  are  the  spherical  vector  wave 

functions  of  the  first  kind  representing  incoming  waves 
associated  with  the  spherical  Bessel  function,  while 
Pp{m,n)  and  Qp{m,n)  are  the  incident  field  expansion 

coefficients  defined  in  [7-8,14].  The  field  in  the  region  II 
can  be  also  expressed  in  terms  of  the  vector  spherical  wave 
functions  of  the  first  and  third  kinds.  Hence  the  electric  and 
magnetic  fields  may  be  wntten  as 

<»  m=n  _ 

+  A'Ep(m,n)Jf^^(r^,9^,(l>^) 

+Amp  (m,  n)M2l  (t>  ) 

+  A\p(m,ri)M  (r^ ,  )] 

oo  m=n  _ 

+  A'Ep(m,n)M2l{r^fi^,<l>^) 

+A'mp  (m,n)JV^^ 

+ A’ Mp(m,n)N2i!  ir^,0p,(p^ )] 

where  ApE(f^hn),  A  pM  A  pE  and 

A  pM  (f^hn)  are  the  field  expansion  coefficients,  while 

and  are  the  vector  spherical  wave  ftinctions  of 
the  third  kind  representing  outgoing  waves  associated  with 
the  spherical  Hankel  function.  The  subscripts  E  and  M 
denote  transverse  magnetic  (TM)  and  transverse  electric 
waves  (TE),  respectively.  The  field  in  region  I  of  the  pth 
sphere  may  be  written  in  terms  of  the  vector  wave  functions 
of  the  first  kind,  i.e., 

»>  m=«  _ 

E,{rp,0p,<f>p)  =  'Z  Z  (7) 

+  A^p{m,n)M  (/'^  ,(9^ 

oo  m=n  _ 

n=\  m=-n  _ 

+A^p(m,n)N2^„(r^,0^,^^)] 


where  A^^p  and  Aj^p  are  the  unknown  transmitted 

coefficients.  Finally,  the  scattered  electric  and  magnetic 
fields  from  the  p^^  sphere  are  expanded  as 


OP  m^n 

EArp,0p,<Pp)  =  'E  Z  \-AEp('n,n)N2J(r^,e^,(pp) 


n=l  m=-n 


(9) 


+  Am„  (m,n)M  {rp,dp,^p )] 


nPArp,0p,^p)=j'Z'X  [AEp(.fn,n)M2^ir^,0^,^p)  (jq) 
+AMp(jri,n)N2lirpAA)\ 

where  Aep  (m,n),  AMp(m,n)  are  the  unknown  scattered  field 
coefficients.  To  express  the  scattered  fields  from  the 
sphere  in  the  coordinate  system  of  the  p**"  sphere,  we  apply 
the  spherical  vector  translation  addition  theorem  for 
translation  along  the  z-axis  [15],  i.e., 

(11) 

^B::{d^)N2lirpA4p)\ 

^’(';A4)  =  Z  (12) 

+BZ^d^^)M2l{rp,ep,^p)\ 


where  /l^v  )  and  J?^"(^p^)are  the  translation 


coefficients  of  the  spherical  vector  translation  addition 
theorem.  To  determine  the  unknown  scattered  field 
coefficients,  we  apply  the  boundary  conditions  on  the 
various  interfaces,  i.e.. 


Ei(bp,ep,(Pp)+  Z  E\bp,0p,^p) 

P=1 

Hi{bp.ep,(f>p)A  Z  ff\bp,0p.</>p) 

p=\ 


=  rpXE„(bb,ep,^n)  03) 


~  tpX H fi(,bpt0p,^p) 


(14) 


rp^Eji(ap,ep,(fip)  =  rpXEj(ap,ep,4>p)  (15) 

rpXHij{ap,dp,^p)  =  rpXH[iap,ep,^p)  (16) 

Substituting  the  appropriate  field  expansion  expressions  in 
equations  (13)  to  (16),  and  applying  the  orthogonality 
properties  of  spherical  vector  wave  functions  and 
eliminating  the  transmission  coefficients  we  obtain 

N  « 

App(m,n)  =  v„(Pp)Pp(m,n)+'^ll  [AZi‘^P<})AEp('”>'')  (17) 

^=1  v=l 
q^p 

+  BZli.dpq)A^p{m,v)] 


A^p{m,n)=u„{Pp)Qp{m,n)+'EY.  M::(4’9)^«p('«>'^)  (18) 


q=\  v=! 
q*p 


■^EZ{dpq)A^^{m,v)] 


where  v„(/>p)and  w„(/?^)are  the  electric  and  magnetic 

scattered  field  coefficients  for  a  single  dielectric  sphere  coated 
with  a  dielectric  layer  [1,9].  Equations  (17)  and  (18)  may  be 
written  in  matrix  form  for  the  purpose  of  computing  the 
scattered  field  coefficients,  i.e. 

A^L+TA  (19) 


where  A  and  L  are  column  matrices  for  the  unknown 
scattered  and  incident  field  coefficients,  respectively,  and 
7"  is  a  square  matrix  which  contains  the  translation  addition 
coefficients. 

Once  the  scattered  field  is  computed  from  equation  (19),  the 
normalized  bistatic  cross  section  can  be  obtained  as  in  [16]. 
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3.  Numerical  Results 

In  order  to  check  the  validity  of  our  computer  program, 
several  numerical  tests  were  conducted  and  the  results 
compared  favorably  with  previously  published  results  [7- 
8,1 1].  These  tests  included  the  limiting  cases  of  (i)  an  array 

of  dielectric  spheres  obtained  by  setting  kb  —  ka  ,  €jj^  =1  or 

(^0  an  array  of  conducting  spheres  each  coated 

with  a  single  dielectric  layer  obtained  by  setting  =<»and 
(iii)  an  array  of  conducting  spheres  obtained  by  setting 
=  ^  andkb^kaor 

In  this  paper,  we  presented  numerical  results  for  different 
sphere  arrays  to  show  the  dependence  of  the  radar  cross 
section  on  various  parameters  characterizing  the  geometry, 
material  properties,  and  incidence  angles.  Fig.  2  shows  the 
normalized  bistatic  cross  section  versus  the  scattering  angle 
0  for  a  system  of  three  identical  spheres  in  the  E  and  H- 
planes.  The  electrical  radii  of  the  outer  and  inner  spheres  are 
ka=2.0  and  kb=2.5,  respectively,  while  the  electrical 
separation  between  successive  spheres  is  kd=7.0,  and  the 
relative  dielectric  permittivity  of  the  inner  dielectric  layer  is 
3.0  and  the  outer  is  air.  The  purpose  of  this  comparison  is  to 
check  the  accuracy  of  the  computer  code  for  the  dielectric 
sphere  case  [8]  as  a  special  case  of  the  dielectric  spheres 
except  the  relative  dielectric  permittivity  of  the  dielectric 
layer  is  set  equal  to  unity.  The  parameters  of  Fig.  3  are 
similar  to  Fig.  2  except  that  the  dielectric  layer  has  a  value 
of  2.  We  can  see  that  the  number  of  resonances  in  E  plane 
is  increased.  Figs.  4  and  5  have  the  same  parameters  as  in 
Fig.  3  except  that  the  number  of  spheres  is  increased  to  five 
and  eight,  respectively.  We  can  see  that  the  number  of 
resonances  also  increases  with  the  number  of  spheres. 

Fig.  6  shows  the  normalized  backscattering  cross  section 
versus  the  electrical  distance  (kd),  which  ranges  from  8 
(touching)  to  15.5  for  end  fire  incidence  and  the  number  of 
spheres  is  five.  The  electrical  radii  of  outer  and  inner 
spheres  are  ka=4.0  and  kb=3.0,  repectively,  while  the 
relative  dielectric  permittivity  of  the  inner  dielectric  layer  is 
3.0  and  for  the  outer  layer  is  2.  Fig.  7  is  similar  to  Fig.  6 
except  the  number  of  spheres  is  increased  to  8.  We  can  see 
that  the  location  of  the  maximum  peaks  did  not  change  by 
increasing  the  number  of  spheres  for  both  cases. 
Furthermore,  the  magnitude  of  the  normalized 
backscattering  cross  section  at  the  maximum  peaks 
increased  with  increasing  number  of  spheres. 

In  Figs.  8  and  9  we  have  plotted  the  normalized 
backscattering  cross  as  a  function  of  the  angle  of  incidence 
OC ,  which  ranges  from  0  to  90  degrees  for  a  system  of  three 
and  eight  spheres.  The  electrical  radii  of  the  outer  and  inner 
spheres  are  ka=1.5  and  kb=1.0,  repectively,  while  the 
relative  permittivity  of  the  inner  dielectric  layer  is  4  while 


for  the  outer  layer  is  3  and  the  electrical  separation  between 
the  centers  of  the  spheres  is  3.0  (touching). 

4.  CONCLUSIONS 

We  have  obtained  an  analytic  solution  of  the  problem  of 
scattering  by  an  array  of  dielectric  spheres  each  coated  with 
a  dielectric  shell.  The  boundary  conditions  are  satisfied  at 
various  interfaces  with  the  help  of  the  vector  translation 
addition  theorem.  The  system  of  equations  was  written  in 
matrix  form  while  the  scattered  field  coefficients  were 
obtained  by  matrix  inversion.  Numerical  results  were 
presented  for  different  numbers  of  spheres,  angles  of 
incidence,  electrical  separation,  and  relative  dielectric 
constant.  For  the  general  case  of  spheres  orientation,  the 
reader  may  find  more  details  in  [8]. 
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Fig.  2:  Normalized  bistatic  cross  section  patterns  for 
three  identical  dielectric  spheres  each  covered  with 
dielectric  layer  with  ka=2.0,  kb=2.5,  kd=7.0, 

and  In  the  E-plane  {(p^Ttll) 

and  H-plane  (  ^  =  0  ). 


Fig.  3:  Normalized  bistatic  cross  section  patterns  for 
three  identical  dielectric  spheres  each  covered  with 
dielectric  layer  with  ka=2.0,  kb=2.5,  kd-7.0, 

a=0,  =3.0,  and  Sjj^  =2.0. 


Fig.  4:  Normalized  bistatic  cross  section  patterns  for  five 
identical  dielectric  spheres  each  covered  with  a  dielectric 

layer  with  ka=2.0,  kb=2.5,  kd=7.0,  ^=0,£;^=3.0,  and 
S  11^=2. 


Fig.  5:  Normalized  bistatic  cross-section  patterns  for  eight 
identical  dielectric  spheres  each  covered  with  a  dielectric 

layer  with  ka=2.0,  kb=2.5,  kd=7.0,  <2=0,£;,  =3.0,  and 
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Fig.  6:  Normalized  backscattering  cross  section  versus 
electrical  separation  (kd)  for  end-fire  incidence  and  a 
linear  array  of  five  identical  dielectric  spheres  each 
covered  with  a  dielectric  layer  with:  ka=4.0,  kb=3.0, 
a  =  0.0,  =3.0,  and  =2.0. 


Fig.  8:  Normalized  backscattering  cross  section  versus 
aspect  angle  (Xfor  a  linear  array  of  three  identical 
dielectric  spheres  each  covered  with  a  dielectric  layer  with 
ka=1.5,  kb=l  .0,  kd=3.0,  £j^  =4.0,  and  =3.0. 


Fig.  7:  Normalized  backscattering  cross  section  versus 
electrical  separation  (kd)  for  end-fire  incidence  and  a 
linear  array  of  eight  identical  dielectric  spheres  each 
covered  with  a  dielectric  layer  with:  ka=4.0,  kb=3.0, 

a  =  0.0,  £j^  =3.0,  and  £jj^  =2.0. 
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Abstract  Fast  solutions  for  UWB  electromagnetic 
induction  (EMI)  scattering  from  fundamental  object 
shapes  are  of  longstanding  interest  for  sensing  of  metallic 
objects,  e.g.  underground  unexploded  ordnance  (UXO) 
detection  and  discrimination.  Researchers  have  recently 
developed  the  general  formulation  for  an  analytical 
solution  for  EMI  scattering  from  a  spheroid.  The 
specialization  based  on  Small  Penetration  Assumption 
(SPA)  is  designed  to  attack  the  high  frequency  difficulties 
that  challenge  many  numerical  techniques.  This  paper  uses 
the  new  analytical  techniques  to  explore  scattering  from 
spheroids  and  other  objects,  with  excitation  complicated 
by  non-uniform  fields.  To  perform  the  necessary 
decomposition  of  the  transmitted  primary  field  into 
spheroidal  modes,  we  represent  the  transmitter  by  a  set  of 
magnetic  dipoles,  which  dramatically  increases  efficiency. 
The  performance  of  the  SPA  solution  is  evaluated  by 
comparison  with  results  from  other  numerical  techniques 
and  measured  data.  Comparison  with  measured  data  also 
indicates  that  EMI  signals  from  some  complicated  objects 
can  be  approximated  by  those  from  spheroids  with  similar 
proportions,  which  is  promising  for  applications  requiring 
fast  solutions,  such  as  inversion  processing. 

Keywords  spheroid,  EMI,  scattering,  SPA,  GEM -3,  non- 
uniform  field,  spheroidal  modes 

1.  Introduction  and  review  of  spheroid  solutions 

In  terms  of  their  physical  responses,  some  complicated  but 
reasonably  smooth,  elongated  objects  can  be  approximated 
by  a  representative  spheroid,  when  observed  from  some 
distance.  The  idea  of  approximate  spheroid  representation 
has  been  applied  in  fluid  mechanics  [1]  and  magnetic  field 
analysis  [2-4].  In  the  EMI  frequency  range  (~  lO's  of  Hz 
up  to  lOO's  of  kHz)  we  are  particularly  attracted  to  the 
possibility,  because  in  this  band  the  smaller  details  of 
shape  may  not  be  important. 


A  great  deal  of  work  has  been  done  in  electromagnetic 
scattering  by  spheroids  and  analytically  shaped  particles, 
with  both  exact  [5,6]  and  approximate  methods  [7-10]. 
Physical  optics  approximations  have  been  used  to  model 
large  particle  scattering  [11,12].  For  scattering  from  more 
than  a  single  particle,  addition  theorems  were  employed 
[13,14].  The  EMI  problem  we  are  studying  here  is 
different  from  the  previously  treated  problems  in  that  (1) 
the  frequency  is  very  low  so  that  the  field  is  magneto - 
quasistatic  (2)  the  scatterer  is  metallic,  with  conductivity 
much  higher  than  that  of  the  surrounding  media.  Tractable 
analytical  and  numerical  solutions  for  the  general  EMI 
problem  have  not  been  available  until  recently  [15-20]. 
New  analytical  solutions  for  the  secondary  (scattered)  field 
from  prolate  spheroids  were  presented  in  [15,16], 
including  high  frequency  approximations,  with 
specialization  for  the  SPA  readily  extended  to  oblate  cases 
as  well  [17].  For  our  applications  here,  we  only  pursue  the 
prolate  case. 

In  the  magneto-quasistatic  EMI  realm,  only  a  scalar 
potential  !Pis  usually  required  for  the  region  surrounding  a 

metallic  scatterer.  The  transmitted  primary  ( )  and 

received  secondary  fields  {y/^)  can  be  expressed  in  that 
region  as  [16] 

^  m  n-m  p=0 

^  m  M=m  p=0 

where  ('n,^,(t))  are  the  standard  spheroidal  coordinates,  d 
is  the  inter-focal  distance,  and  are  Associated 

Legendre  functions,  7’pm((t))  is  cos(m^)  for  p  =  0  and  is 
sin(w^)  for  p  =  1.  The  coefficients  bpmn  for  the  primary 
field  are  known  (readily  calculated),  and  the  unknown 
Bpmn  iBust  be  solved  for. 
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For  high  induction  numbers  (small  skin  depth)  cases,  one 
can  derive  relations  reminiscent  of  impedance  boundary 
conditions  to  treat  the  effects  of  internal  fields.  Different 
approaches  to  this  are  possible,  the  most  general  involving 
use  of  the  magnetic  field  divergence  equation  and  the 
normal  field  components  and  derivatives,  e.g.  [18,20]  in 
the  numerical  realm.  Alternatively,  for  analytical  solution, 
all  higher  order  terms  are  neglected  in  the  governing 
double  curl  equation  in  [16],  mostly  involving  tangential 
gradients  inside  the  object.  Then  to  derive  an  applicable 
gradient  condition  just  below  the  scatterer's  surface,  the 
tangential  field  components  are  assumed  to  have 
functional  dependencies  as  in  a  1-D  frequency  domain 
solution  with  respect  to  the  normal  component,  i.e.  ~ 
tx\>{ia„kn),  where  k  is  the  equivalent  of  wave  number, 
here  equal  to  ^/coap  for  angular  frequency,  electrical 

conductivity,  and  magnetic  permeabilities  co,  0,  and  p, 
respectively.  In  the  specific  case  of  the  spheroids,  oCn  is 
unity  for  the  azimuthal  component  and  can  be  solved  for 
analytically  for  the  angular  component  With  these 
approximations,  one  can  solve  a  simple  algebraic  system 
for  the  unknown  Bp^n,  corresponding  only  to  an  exterior 
problem  in  the  scalar  potential,  with  no  expressions 
involving  the  problematical  spheroidal  wave  functions  in 
the  object  interior.  The  resulting  spheroid  solutions  for  a 
spatially  uniform  primary  field  indicate  that  the  SPA  may 
produce  accurate  results  over  the  entire  EMI  broadband,  in 
particular  where  magnetic  permeability  fi  is  high,  as  for 
steel,  whether  skin  depth  is  small  or  not.  See  [1 5,16,23]  for 
more  details. 


2.  Fast  Decomposition  of  the  primary  field  into 
spheroidal  modes 

To  solve  the  EMI  scattering  problem  for  a  spheroid  using 
our  algorithm,  the  key  task  is  to  decompose  the  known 

primary  field  into  spheroidal  modes  (i.e.  find  in 

(1)).  This  is  easily  done  for  a  uniform  field,  but  otherwise 
may  require  new  analytical  expressions  or  numerical 
computation.  In  our  approach,  one  multiplies  both  sides  of 

(1)  by  Pni'n)  and  then  integrates,  using 

established  orthogonality  relations  for  the  Legendre 
functions  to  obtain 

Ijt 

f ,  K  1 

^  _ 0 . . .  ^2) 

2n  +  \  "  ”  {n-m)\ 


where  a=l  for  m  >  0,p  =  1  or  m  >  1,/?  =  0  ,  and 
a  =  2  for  m  =  0,/?  =  0.  ^  =  4  is  the  surface  of  the 

spheroid.  At  some  distance  away  from  the  sensor  head, 
the  excitation  field  produced  by  most  EMI  transmitters  can 
be  approximated  using  dipole  sources.  For  a  magnetic 
dipole  with  dipole  moment  m  =  at  location 

Rq  potential  and  magnetic  field  will  be 

[21] 


¥ 

H 


m  (R-R3) 
4;r|R-Rj’ 

1 

4;r|R-R,|’ 


^3(R-R,)(R-R.) 


R-R. 


(3) 


m 


At  this  point  the  potential  y/  can  be  translated  into 
spheroidal  coordinates  and  be  decomposed  according  to 
(2),  through  numerical  integration. 

Lab  measurements  were  obtained  using  the  Geophex 
GEM-3  broadband  EMI  sensor  [22].  The  sensor  head 
contains  two  transmitting  current  loops  with  radii 
approximately  20cm  and  10cm.  The  current  in  the  inner 
loop  is  about  half  of  that  in  the  outer  loop  and  it  flows  in 
the  opposite  direction,  so  that  the  primary  field  at  the 
receiver  in  the  head  center  is  near  zero.  The  primary  field 
near  the  sensor  is  complicated  and  cannot  be  calculated 
analytically,  so  at  any  point  of  interest  H  must  be 
calculated  according  to  the  Biot-Savart  Law  [21].  The 
potential  can  then  be  computed  by  integration  of  the 
magnetic  field  along  an  arbitrary  path.  After  the  values  of 
potential  on  the  spheroid  surface  are  obtained,  the  field  can 
be  decomposed  into  spheroidal  modes  according  to  (2). 


For  a  complicated  such  as  that  in  the  GEM-3  near 
field,  decomposition  requires  numerical  integration  in 
several  steps,  which  are  too  time  consuming  for  our 
inversion  calculations  when  the  potential  is  computed  as 
described  above.  Therefore  we  approximate  the  actual 
source  by  a  superposition  of  some  basic  fictitious  sources 
(i.e.  point  magnetic  charges  or  dipoles),  whose  potential 
and  magnetic  field  can  be  specified  analytically.  In 
keeping  with  the  geometry  of  the  GEM-3  sensor,  the 
sources  are  distributed  with  azimuthal  symmetry.  The 
sensor  produces  fields  like  a  magnetic  dipole  in  the  far 
field,  and  the  magnetic  field  direction  near  the  sensor  is 
mainly  perpendicular  to  its  broad  surfaces.  Therefore  we 
distribute  the  magnetic  dipole  sources  only  on  the  sensor 
surface,  such  that  the  dipoles  only  have  a  component 
perpendicular  to  the  sensor  head.  The  source  distribution 
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contains  Mj  at  R^,  =(0,0,0)  ,  and  at 


R^ij  =  (/7,.cos(0^),/?,sin(6»^.),O)  for  i  =  , 

j  =  1,2,3,--  Nf  ,  where  are  magnitudes  of  point 

dipoles  and  and  Nj  are  the  number  of  rings  and  number 
of  sources  in  each  ring,  respectively  (Figure  1).  The  basic 
idea  is  to  distribute  the  dipoles  symmetrically,  with 
different  numbers  at  different  radii.  The  magnetic  field  at 

position  R  will  be  H  =  ^ ^  Aj  with 


1 

^3(R- 

R.K^. 

R-R. 

3 

R-R. 

2  ^ 

J 

(4) 


and  =  1 ,  subscripts  “s”  and  “z”  refer  to  source  and  z 
component. 


b 

Figure  1.  a)  Set  of  dipole  sources  (X's);  the  two  solid  lines 
are  the  inner  and  outer  current  loop  of  the  GEM-3  sensor, 
b)  Control  surface  (I)  employed  to  determine  dipole 
sources  and  the  testing  surface  (II)  used  for  evaluating  the 
accuracy  of  the  field  from  these  sources. 


the  primary  magnetic  field  determined  by  the  fully  detailed 
representation  of  the  sensor  loops  and  that  obtained  from 
the  set  of  dipole  sources.  Then  the  transmitted  potential  of 
GEM-3  can  be  approximated  by  Figure  2 

r  ^ 


¥ 


pr 


R-R 


sij 


(5) 


and  decomposed  according  to  (2) 

3.  Results  and  discussion 


3.1  Accuracy  of  SPA  results 

The  accuracy  of  spheroidal  SPA  solutions  under  a  uniform 
primary  field  has  been  well  studied  [16,23].  As  we  will 
show  below,  the  field  from  the  GEM-3  sensor  can  be 
accurately  represented  by  a  set  of  point  dipoles.  So  we 
begin  here  by  studying  the  performance  of  SPA  algorithm 
for  objects  under  the  non-uniform  fields  from  dipole 
sources.  Figure  2  shows  results  for  a  spherical  object  and 
for  a  1x4  prolate  spheroid,  where  the  induction  number 

\k\a  =  -yjco^aa . 

sphere,  a=:10cm 


Spheroid,  as10cm,b=40cm 


Figure  2.  Scattered  field  from  the  sphere  and  1x4  spheroid 
under  dipole  excitation.  [I  =  50//^ .  The  solid  lines  are 

results  obtained  using  the  MAS  and  dashed  lines  are  from 
the  SPA. 


For  given  magnetic  field  at  control  points  (on  the  control 
surface  in  Figure  1  b),  a  mean  least  square  was  employed 
to  determine  Mi,  i.e.  by  minimizing  the  difference  between 
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f{H2)  f(Hz) 


Figure  3  Comparison  of  SPA  (markers)  and  MAS  (lines)  values  ( )  for  a  prolate  spheroid  (a=4cm,  b=24cm)  in  a  5x5 
grid  of  sensor  locations.  O’  =  4x10*  =  100 ,  Xq  =  3^0  =  0 ,  Zg  =  —  30c/w,  =0,  6^  —7t  I A . 


Results  are  compared  with  those  from  a  verified  and 
complete  numerical  approach,  namely  the  MAS-TSA 
algorithm  [20]  and  are  displayed  in  terms  of  received 
components  in  phase  with  the  primary  field  (real 
component)  and  those  in  phase  quadrature  with  it 
(imaginary  component).  The  radius  of  the  sphere  is  10cm, 
the  dipole  is  15cm  away  from  its  center,  and  the  scattered 
field  was  determined  at  the  same  position  as  the  dipole 
transmitter.  Similarly,  the  1x4  prolate  spheroid  has  the 
same  diameter  but  greater  length  (a=10cm,  b=40cm),  and 


the  dipole  is  at  position  (0,0,60cm).  Although  the  field 
around  the  target  is  non-uniform,  the  SPA  result  still 
works  very  well  over  a  wide  frequency  range. 

To  test  the  accuracy  of  SPA  results  for  more  general  cases, 
we  consider  a  real  EMI  sensor  (GEM-3  developed  by 
Geophex,  Ltd.);  center  of  the  sensor  is  moved  over  a  5x5 
grid  on  a  surface  above  the  target.  The  total  size  of  the  grid 
is  50cm  by  50cm.  A  global  coordinate  system  (x,  y,  z)  on 
the  grid  has  its  origin  at  the  center  and  z  axis  perpendicular 
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to  the  surface.  The  target  center  is  at  (xq,  yo,  Zo)  =  (0»  Zo) 
in  the  global  system,  with  units  in  centimeters.  The 

orientation  is  described  by  6q  (the  angle  with  z  axis)  and 
(pQ  (angle  with  x  axis  of  the  grid).  At  each  grid  point  the 

data  were  calculated  for  17  frequencies  (distributed  from 
30Hz  to  47,970Hz),  for  both  real  and  imaginary 
components.  An  example  is  shown  in  Figure  3,  with  MAS 
results  as  reference  [20].  Results  indicate  that  the  SPA 
routine  is  generally  reliable  in  the  frequency  range  studied, 
for  non-uniform  source  fields,  at  least  for  these  smooth 
and  regular  target  shapes. 

3.2  Representing  the  GEM-3  field  with  magnetic 
dipoles 

Next  we  test  the  accuracy  of  the  GEM-3  primary  field 
represented  by  point  dipoles.  The  "exact”  magnetic  field 
on  the  control  and  testing  surfaces  was  calculated  from 
direct  integration  around  the  current  loops  [21,22],  and 
field  values  on  the  control  surface  were  employed  to 
determine  point  dipoles  needed  to  represent  the  overall 
source  field.  Then  the  approximate  transmitted  field  was 
compared  on  testing  surface  with  that  calculated  from 
direct  integration.  For  this  example  we  choose  the  control 
surfaces  at  z  =  10  cm  and  x  =  30cm  (they  are  in  fact  only 
lines  because  of  the  symmetry  properties).  The  testing 
surfaces  are  at  z  =  15  cm  and  x  =  45  cm.  As  described 
above,  we  distribute  the  dipole  of  sources  in  several  rings. 
The  first  dipole  is  at  the  center,  then  the  i*  (for  /  >  2 ) 

ring  has  radius  of  p.  =— — x20cm,  where  is  the 

total  number  of  rings  of  sources,  the  number  of  sources  in 
the  i*^  ring  is  2i  .  Figure  4  shows  the  comparison  of 
approximated  field  with  the  "exact"  one  (namely 
calculated  directly  from  current  loops)  on  the  testing 
surfaces.  The  approximated  field  converges  to  the  exact 
one  as  the  number  of  sources  increases.  For  this  case,  6  or 
8  rings  of  dipole  sources  is  sufficient  to  describe  the  field 
outside  of  the  testing  surface. 


3.3  Representing  complicated  objects  with 
spheroids 

For  EMI  sensing,  an  arbitrary  scatterer  can  be 
approximated  by  one  or  a  few  magnetic  dipoles  when 
viewed  in  the  far  field.  However,  this  approximation 
becomes  less  accurate  as  the  sensor  gets  closer,  and  in 
much  UXO  sensing  we  must  operate  in  the  very  near  field. 
Over  most  of  the  near  field,  a  prolate  spheroid  of  finite 
extent  may  be  a  better  representative  of  the  object  than  a 
small  number  of  infinitesimal  dipoles.  To  investigate  the 
applicability  of  spheroidal  representations  for  more 


irregular  objects,  we  consider  the  response  of  a  jagged, 
non-BOR  piece  of  elongated  metal  scrap  (Figure  5).  This 
ordnance  fragment  was  collected  at  a  UXO  cleanup  test 
site.  Note  that  the  target  geometry  is  complicated,  with 
jagged  appurtenances  and  different  profiles  in  different 
rotations  about  its  long  axis. 

Figure  6  shows  the  comparison  between  measured  GEM-3 
data  from  the  piece  of  ordnance  scrap  and  simulation  for  a 
best-fit  prolate  spheroid.  The  distance  between  the  object 
center  and  the  sensor  is  10  cm,  approximately  the  same  as 
the  target  length.  Despite  the  irregularities  and  asymmetry 
of  the  target,  it  produces  an  EMI  signature  similar  to  that 
of  a  prolate  spheroid  with  approximately  the  same  overall 
proportions.  The  object’s  lack  of  rotational  symmetry  has 
some  effect  on  the  signature  in  the  orientation  transverse  to 
the  primary  field,  but  strikingly  little.  These  results 
encourage  further  exploration  of  equivalent  spheroid  use 
for  UXO  discrimination.  The  scheme  would  rest  on  the 
assumption  that  one  can  find  a  sufficiently  equivalent 
spheroid  for  UXO  of  interest  and  apply  this  equivalence  in 
model-based  inversion  algorithms.  For  given  measured 
EMI  data,  one  would  identify  a  spheroid  that  produces 
similar  EMI  signals  by  doing  inversion  or  optimization, 
given  that  we  have  fast  algorithm  for  calculating  the 
forward  EMI  solution  from  spheroids.  We  expect  that  the 
scale  and  proportions  of  the  spheroid  will  reflect  the 
geometrical  information  of  the  measured  object.  One  can 
also  calculate  derivatives  of  the  forward  problem  solution 
analytically,  for  use  in  Jacobians  required  for  inversion 
processing. 

Pursuing  this.  Figure  7  shows  an  example  real  UXO, 
composed  of  a  main  body  (magnetic),  a  copper  band,  and 
fins  and  tail  with  different  steel.  With  the  UXO  beneath 
the  measurement  grid  as  shown  in  Figure  7,  its  EMI 
responses  were  measured  along  X  and  Y  axis  (9  points 
along  each  line)  using  the  GEM-3  sensor  and  a 
representative  spheroid  was  found  by  fitting  the  measured 
data.  The  EMI  data  from  the  representative  spheroid  are 
shown  in  Figure  8.  The  first  9  figures  (the  first  and  second 
lines)  are  data  along  X  axis  (y=z=0),  and  the  second  9 
figures  are  data  along  Y  axis  (x=z=0).  Comparison  with 
the  measurement  data  shows  that  even  for  this  complicated 
composite  object,  over  an  array  of  viewing  angles  by  the 
sensor,  it  was  still  possible  to  find  a  representative 
spheroid  which  produces  similar  EMI  response.  One 
should  note  that  this  is  a  cooperative  case  because  the  steel 
part  is  much  larger  than  other  parts,  and  is  close  to  the 
sensor,  so  that  it  dominates  the  response.  In  other 
orientations  in  which  different  sensor  positions  highlight 
the  tail  of  the  UXO,  we  expect  more  difficulty  in  matching 
it  with  a  single  spheroid.  See  [24]  regarding  other 
treatments  of  this  same  target.  Overall,  the  very  good  fit 
observed  here  may  not  always  be  obtained  for  general 
composites  in  highly  non- symmetric  targets. 
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Hz  along  line  x=0.45m 


Hr  along  line  z=0.15m 


Hx  along  line  x=<l.45nri 


z 


Hx  along  line  z=0.15m 


Figure  4.  Convergence  of  the  approximate  field  from  the  transmitter  to  the  exact  one,  over  the  testing  surfaces. 


Figure  5.  Piece  of  ordnance  scrap  on  which  measurements 
were  performed,  with  length  about  10  cm,  width  in  one 
transverse  direction  3~4  cm  and  about  2cm  in  the  other. 


^l3l  Transverse 


Figure  6.  Comparison  of  the  measured  EMI  signal  from 
the  fragment  in  Figure  5  (lines)  with  that  calculated  for  a 
prolate  spheroid  with  a  -  1 .25  cm,  b  =  5  cm, 

=  2.6  X 1 0^  (S/m),  //  =  36/J,,  (dots).  Different  line  types 

are  for  measurements  from  different  views  (i.e.  up  and 
down  for  the  axial  case,  and  four  90°  rotations  about  the 
axis  for  the  transverse  case). 
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Figure  7.  UXO  28  cm  in  length  and  about  8.3  cm  in 
diameter  at  its  widest  point.  Measurements  are  along  X 
and  Y  axis,  which  is  on  a  surface  22cm  above  the  target. 
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Figure  8.  Scattered  field  (in  ppm)  of  UXO  (markers)  and  its  representative  spheroid  ( a  =  2.2  C/n,  6=1  Acm ) 
along  X  and  Y  axis.  <r  =  4xl0*,  =227,  =3.^cm ,  = -0.93cm  ,Zq  =  -22cm  ,^o=0  9^  =nl2. 
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4.  Concluding  Discussion 

The  results  presented  here  contribute  to  the  computational 
capabilities  necessary  for  inversion  of  EMI  measurements, 
in  applications  such  as  discrimination  of  UXO  from  scrap 
[25-27].  To  approach  that  problem,  we  must  be  able  to 
model  both  targets  of  interest  as  well  as  clutter.  With  great 
economy,  the  formulations  above  represent  the  primary 
field  from  a  real  sensor,  using  distributions  of  magnetic 
dipoles.  This  source  construction  is  particularly  beneficial 
because  one  can  readily  decompose  the  primary  field  from 
each  contributing  source  into  spheroidal  components.  For 
each  of  these,  in  turn,  fast  analytical  solutions  can  be 
obtained,  for  steel  objects.  Comparison  of  EMI 
measurements  and  simulations  suggests  that  even  some 
rather  irregular  scatterers  may  be  represented  well  by 
spheroidal  shapes.  Some  limitations  of  the  SPA  type 
formulations  can  appear  at  low  frequencies,  especially  for 
objects  with  low  permeability  [20,23].  Ultimately,  this 
problem  can  be  overcome  by  using  the  emerging  full 
analytical  spheroid  solution,  as  opposed  to  the  SPA  [28, 
29].  While  a  great  many  targets  of  interest  can  be 
idealized  as  bodies  of  revolution,  future  work  should 
attack  similar  development  of  solutions  for  ellipsoidal 
shapes.  With  three  different  principal  axis  dimensions  and 
correspondingly  different  directional  scattering,  they 
would  provide  the  most  flexible  tool  for  inversion 
calculations. 
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Abstract:  In  order  to  provide  rigorous  dosimetry  for  in 
vitro  studies,  Telstra  Research  Laboratories  has 
developed  a  modified  transverse  electromagnetic  cell 
exposure  system.  The  system  acts  as  a  chamber  for 
experiments  in  which  a  human  cell  culture  exists  as  a 
very  thin  monolayer  adhered  to  the  bottom  of  a  plastic 
culture  flask  under  a  layer  of  several  millimetres  of 
RPMI  nutrient  medium  while  incubated  in  a  controlled 
atmosphere.  A  key  to  the  rigour  of  any  experiment 
seeking  to  investigate  possible  effects  of  electromagnetic 
energy  on  living  systems  is  to  ensure  that  the  exposures 
used  are  accurately  known,  and  to  achieve  this, 
numerical  methods  for  the  challenging  task  of 
characterising  the  SAR  profile  in  the  medium  have  been 
developed.  This  paper  describes  salient  aspects  of  the 
development  and  analysis  of  the  system. 

Keywords:  FDTD  modelling;  RF  exposure;  TEM  cell; 
SAR;  GSM 

1.  Introduction 

To  enable  the  future  investigation  of  possible 
biological  effects  of  mobile  phone  exposure,  this  paper 
considers  an  experimental  setup  where  a  human  cell 
culture  can  be  exposed  to  900  MHz  GSM  type 
radiofrequency  (RF)  signals.  The  exposure  system 
consists  of  a  modified  Crawford  transverse 
electromagnetic  (TEM)  cell  (Crawford  1974),  which  is 
supplied  with  a  simulated  GSM  mobile  phone  type 
signal  at  20  W  peak  power  (900  MHz  RF  signal  pulsed 
at  1/8  duty  cycle  for  2.5  W  average  power).  The  cell 
culture  is  placed  in  the  TEM  cell  in  standard  Falcon'^^ 
25  cm^  plastic  culture  flasks  and  resides  as  a  very  thin 
monolayer  (a  few  microns)  at  the  bottom  of  6  mm  of  the 
nutrient  RPMI- 1640  (Roswell  Park  Memorial  Institute) 
while  incubated  in  a  37  ®C  CO2  atmosphere.  This  setup 
is  similar  to  that  discussed  in  French  and  Blood  2002 
which  investigated  possible  gene  expression  changes  in 
live  human  astrocytes. 

For  this  situation,  the  specific  energy  absorption  rate 
(SAR)  (W/kg)  is  commonly  accepted  to  be  the  most 
appropriate  metric  for  determining  RF  exposure.  SAR 
can  be  determined  at  any  point  in  a  medium  from  the 
£'-field  (V/m)  at  that  point: 


P 


where  cr  is  the  conductivity  (S/m)  and  p  is  the  mass 
density  (kg/m^). 

Many  RF  exposure  systems  rely  on  measurements 
of  average  absorption  within  the  target  material  to 
determine  the  SAR.  In  a  TEM  cell,  the  actual  SAR  at 
any  given  point  in  the  exposed  medium  will  differ 
markedly  from  the  average  value.  The  geometry  used 
here,  where  the  height  (/?)  of  the  medium  coincides  with 
the  direction  of  the  incident  electric  {E)  field,  ensures 
that  there  is  the  essential  uniformity  in  SAR  on  the 
bottom  layer  of  the  RPMI  medium  where  the  cell  culture 
resides.  However,  it  also  means  that  SAR  typically 
varies  quadratically  with  h  within  the  medium  (see 
Burkhardt  et  al.  1996,  Guy  et  al.  1999,  Samaras  et  al. 
2000,  and  Schdnbom  et  al.  2001).  This  makes  it  critical 
that  accurate  computational  techniques  are  used  to 
provide  a  good  estimate  of  SAR  (especially  on  the 
bottom  layer). 

2.  Choice  and  Design  of  Exposure  Chamber 

To  achieve  well  controlled  and  characterised  RF 
exposures  has  been  a  concern  in  many  previous  studies. 
References  Guy  et  al.  1999,  Kuster  and  Schonbom  2000, 
and  Schonbom  et  al.  2000  and  2001,  discuss  exposure 
requirements  and  the  merits  of  various  exposure  systems. 
Such  systems  include  waveguides,  radial  transmission 
lines  (RTL),  TEM  cells.  Gigahertz  TEM  (GTEM)  cells, 
and  free-field  exposure  in  anechoic  chambers. 

Based  on  considerations  of  physical  size, 
experimental  needs,  and  required  field  characteristics,  a 
TEM  cell  was  chosen.  A  TEM  cell  provides  uniform 
incident  plane  wave  exposure  conditions,  useful  for 
simplifying  analysis,  as  long  as  each  culture  flask  is 
placed  in  a  central  position  and  the  absorbing  medium  is 
not  too  thick  (see  Crawford  and  Workman  1979, 
Burkhardt  et  al.  1996,  Popovic  et  al.  1998,  and 
Schonbom  et  al.  2001)  (Figure  1).  Modifications 
included  provision  of  adequate  ventilation  in  the  37  ®C 
CO2  incubator  (Figure  2);  access  ports  and  inserts  to 
allow  placement  of  the  two  flasks  at  the  midpoint  of  the 
top  half  of  the  TEM  cell  (Figure  3);  and  fixtures  to  allow 
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ingress  and  accurate  location  of  fluoroptic  temperature 
probes  (Figure  4).  The  measured  return  loss  of  the  input 
signal  was  greater  than  30  dB,  To  ensure  CO2  diffusion 
and  suitable  power  efficiency  the  height  of  the  RPMI 
was  chosen  to  be  6  mm.  Total  absorbed  power  with  the 
TEM  cell  loaded  with  two  flasks  containing  the  RPMI 
was  less  than  4%  of  forward  power,  indicating  that  the 
TEM  field  conditions  were  maintained  within  the  TEM 
cell. 

3.  Determination  of  SAR 

The  commercially  available  computational  software 
XFDTD  (Remcom  2003)  was  used  to  determine  the  SAR 
profile  in  the  RPMI  (Figure  5).  The  SAR  was  used  to 
report  the  background  energy  level  in  which  the  cell 
culture  was  placed  and  also  to  validate  the  setup  through 
the  resultant  temperature  estimation.  XFDTD  uses  the 
finite-difference  method  (see,  for  example,  Kunz  and 


Figure  1 .  The  electric  and  magnetic  fields  within  a  TEM  cell, 
with  a  plane  wave  type  structure  at  the  midpoint  of  the  top  half 


Luebbers  1993).  A  computational  mesh  of  the  6  mm 
medium  was  constructed  with  cubical  cells  of  length 
0.5  mm.  The  conductivity  of  RPMI  was  set  at  1.8  S/m, 
relative  permittivity  73.2,  and  density  1000kg/m^  The 
incident  field  frequency  was  900  MHz  with  the  £-field 
polarised  in  the  vertical  direction. 

For  the  supplied  plane  wave  signal  (E-field  214  V/m 
peak  amplitude,  equivalent  to  2.5  W  average  power  in 
the  TEM  cell),  the  calculated  peak  SAR  on  the  bottom  of 
the  medium  was  0.20  W/kg.  The  average  SAR  on  this 
level  was  found  to  be  0.18  W/kg.  Figure  5  highlights  the 
uniformity  of  SAR  on  the  bottom  layer,  and  the  non¬ 
uniformity  in  the  6  mm  vertical  direction  (this  profile  is 
consistent  with  published  results  Burkhardt  et  al.  1996, 
Guy  et  al.  1999,  Pickard  et  al.  1999,  Samaras  et  al.  2000, 
and  SchOnbom  et  al.  2001 ). 


Figure  2.  TEM  Cell  placed  in  the  incubator 
(with  ventilation  holes  shown). 


Figure  3.  Two  culture  flasks,  each  filled  with  6  mm  of  RPMI, 
placed  in  the  middle  of  the  top  half  of  the  TEM  cell. 


Figure  4.  The  placement  of  fluoroptic  temperature 
probes.  Three  probes  were  placed  at  distinct  regions 
of  one  flask.  Each  tip  was  situated  in  the  medium, 
0.5  mm  above  the  flask  bottom. 
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Figure  5.  SAR  profile  in  the  RPMI  medium,  top:  Slice  through  the  middle  (note  the  medium  in  the  neck  of  the  flask  on  the 

left),  bottom:  Bottom  layer  where  cell  culture  resides. 


An  uncertainty  analysis  suggested  the  actual  SAR 
values  should  agree  with  these  values  to  within  ±35%. 
The  main  contributing  factors  to  the  possible  error  are 
the  input  power  (accurate  to  within  ±  25%;  has  a  linear 
influence  on  the  calculated  SAR),  height  of  the  medium 
(accurate  to  within  ±  0.5  mm  resulting  in  a  possible 
±  17%  error;  quadratic),  and  the  conductivity  (±  10%; 
approximately  linear).  The  modelling  has  also 
highlighted  the  significant  influence  of  the  meniscus  due 
to  the  ^-field  polarisation  (see  also  Guy  et  al.  1999, 
Schdnbom  et  al.  2001,  and  Schuderer  and  Kuster  2003). 
The  addition  of  the  meniscus,  1  mm  high  and  1  mm  in 
from  the  edge,  as  shown  in  the  top  right  of  Figure  5,  lead 
to  a  5%  increase  in  SAR  on  the  bottom  layer. 

Note  that  SAR  has  only  been  determined  in  the 
nutrient  medium.  The  method  of  determining  SAR 
without  including  the  cell  culture  layer  itself  is  consistent 
with  work  by  other  investigators  in  this  field  (see,  for 
example,  Schonbom  et  al.  2000  and  2001).  To  also 
determine  SAR  in  the  actual  cell  culture  is  a  difficult 
task  since  the  height  of  this  layer  is  just  a  few  microns. 
If  the  cell  culture  (assumed  to  be  human  brain  cells)  is 
included  in  the  model  as  an  infinitely  thin  planar  layer  at 
the  bottom  of  the  medium  (with  conductivity  0.77  S/m 
and  relative  permittivity  45.8  as  for  average  brain 
(Gabriel  1996))  then  the  peak  SAR  at  the  bottom  is 
reduced  to  0.14  W/kg.  If  instead,  the  cell  culture  is 
modelled  by  the  bottom  0.5  mm  layer  of  mesh  cells,  then 
the  peak  is  0.10  W/kg.  The  actual  value  is  likely  to  lie 
somewhere  in  this  range,  so  long  as  the  dielectric  values 
for  the  cell  culture  have  been  appropriately  chosen  in  the 
models.  Better  consideration  of  this  issue  of  the  SAR  at 
the  cell  culture  layer  may  be  possible  as  improved 
modelling  techniques  become  available. 

The  efficacy  of  using  a  plane  wave  incident  field  to 
simplify  the  analysis  was  confirmed  by  an  XFDTD 


model  of  the  rectangular  central  portion  of  the  TEM  cell 
(using  a  ‘TEM  Excitation  Plane’).  The  resultant 
electromagnetic  fields  have  the  required  plane  wave  type 
structure  at  the  position  of  the  flasks. 

4.  Comparison  with,  and  Application  of,  the 
Burkhardt  Formula 


In  Burkhardt  et  al.  1996,  an  analytical  formulation  is 
derived  that  provides  SAR  as  a  ftinction  of  height 
(thickness),  h,  of  the  medium  (which  is  taken  to  be  in  the 
direction  of  the  incident  field  E).  It  is  assumed  that  the 
geometry  can  be  simplified  to  that  of  a  thin  conductive 
sheet.  This  enables  a  “...  separation  between 
‘capacitive*  and  ‘inductive’  coupling  ...  the  capacitively 
induced  part  of  the  £'-fIeld  [Ecap]  is  normal  and  the 
inductively  induced  part  [£’,w]  is  parallel  to  the  surface 
of  the  [sheet].”  The  formula  is 


SAR{z^) 


) 


\  ^0  J 


\ 


+ 


1 


(2) 

where  it  is  assumed  that  the  centre  of  the  bottom  of  the 
sheet  (lying  in  the  x-y  plane)  is  at  the  origin  of  a 
Cartesian  coordinate  system,  and  z'-z-h/2,  co  =  2;zf, / 
is  the  frequency,  ek  =  €r-  ia/(€o  co),  and  3770.. 

To  consider  the  accuracy  of  the  Burkhardt  formula 
with  h,  XFDTD  models  of  finite  sheets  were  constructed 
at  a  series  of  values.  In  each  XFDTD  model,  the  sheet 
was  of  dimensions  100  mm  x  100  mm  and  was  12  mesh 
cells  high,  and  the  material  and  field  properties  were 
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identical  to  those  above.  For  each  method,  the  SAR 
values  at  the  bottom  of  the  sheet  (centrally  placed  in  the 
XFDTD  model)  were  compared.  The  comparison 
indicated  that  there  w  as  close  agreement  for  a  4  mm  high 
sheet  (Burkhardt  0.0648  W/kg,  XFDTD  0.0634  W/kg), 
around  a  10%  difference  at  6  mm  (0.138  W/kg, 
0.152  W/kg),  with  the  accuracy  diminishing  at  12  mm 
(0.535  W/kg,  1.30  W/kg).  The  analysis  also  highlighted 
that  the  desired  uniformity  over  the  bottom  layer  also 
diminishes  significantly  as  h  increases. 

The  formula  thus  provides  a  coarse  validation 
method  for  checking  the  results  from  an  XFDTD 
analysis  of  the  SAR  in  a  flask.  In  particular,  at  6  mm, 
the  Burkhardt  formula  gives  the  value  of  0.138  W/kg 
(the  sheet  is  the  only  possible  analytic  geometry) 
compared  with  the  XFDTD  value  0.20  W/kg  (calculated 
above  for  the  flask). 

However,  the  formula  offers  a  very  prompt  method 
to  calculate  approximate  SAR  values  (an  XFDTD 
calculation  takes  a  few  hours),  which  is  useful  in  initial 
experimental  design.  It  can  also  be  used  to  test  the 
sensitivity  of  input  parameters.  For  example,  the 
formula  indicates  that  SAR  varies  quadratically  with  h 
(closely  approximating  the  profile  of  the  vertical  slice  in 
Figure  5).  Since  the  cells  reside  on  the  bottom  of  the 
medium,  this  emphasises  the  need  to  choose  and 
measure  this  paramete  r  with  great  care. 

5.  SAR  Validation  through  Temperature 
Measurement  and  Modelling 

The  efficacy  of  using  XFDTD  to  determine  the  SAR 
profile  was  also  confirmed  through  comparison  of 
physical  measurements  of  the  resultant  thermal  rise  in 
the  medium  with  numerically  calculated  estimates  based 
on  the  predicted  SAR.  Tliermal  RF  dosimetry  is  based 
upon  the  equation 

fX—  =  IC7^T  +  pSAR  (3) 

8l 

where  c  is  the  specific  heat  capacity,  and  AT  is  the  thermal 
conductivity. 

A  standard  approach  in  the  determination  of  SAR  is 
to  ignore  the  conduction  term  in  equation  2  and 

measure  the  initial  rate  of  temperature  increase 
(see,  for  example,  Rowley  and  Anderson  1999). 
Flowever,  this  approach  is  not  possible  in  this  case  due  to 
the  high  vertical  SAR  gradient,  unless  either  highly 
sensitive  temperature  probes  are  used  or  the  input  power 
is  significantly  increased  (see  Moros  and  Pickard  1999, 
Pickard  et  al.  1999,  and  Samaras  et  al.  2000).  (An 
alternative  is  to  use  equation  1  and  use  an  F-field  probe 
with  a  tip  that  is  small  enough  and  positioned  so  that  it 
does  not  disrupt  the  exposure  (Schdnborn  et  al.  2000).) 


In  the  validation  measurements,  the  input  power  to 
the  TEM  cell  was  set  at  113.5  W.  Fluoroptic  immersion 
temperature  probes  (Luxtron  unit  790,  probe  type  SFF) 
were  chosen,  as  there  are  no  metallic  components  to 
produce  interference  with  the  F-field.  Measurements 
were  performed  when  the  two  flasks,  each  filled  with 
6  mm  high  medium  were  placed  in  the  TEM  cell,  and 
then  again  when  the  two  flasks  each  had  1 2  mm  of 
medium.  The  medium  consisted  of  RPMI  in  a  gelled 
form  (using  Natrosol  250  HR,  7.4%  by  mass)  to  reduce 
the  effect  of  thermal  diffusion  due  to  convection  (no  cell 
culture  was  present).  The  measured  electrical 
conductivity  of  the  gel  (with  added  common  salt,  0.21% 
by  mass,  to  obtain  comparable  conductivity  to  liquid 
RPMI)  was  2.05  S/m  and  the  relative  permittivity  72.6. 
The  measurements  were  performed  in  an  environment 
that  was  not  temperature  controlled.  These  conditions 
were  not  sufficient  to  accurately  determine  local  SAR 
from  alone,  since  the  chosen  temperature  probes 
have  inadequate  sensitivity  at  this  power  level. 

Thermal  modelling  was  undertaken  through  the 
development  of  a  finite-difference  temperature 
modelling  environment  based  on  equation  3  (using  the 
numerical  formulation  in  Wang  and  Fujiwara  1999,  with 
the  XFDTD  mesh),  and  where,  in  addition,  the  thermal 
conditions  at  the  surface  are  modelled  as  a  convective 
boundary: 

=  -h{T-T,)  (4) 

dn 

where  h  is  the  convection  coefficient,  and  Ta  is  the 
ambient  temperature.  For  the  RPMI  gel,  the  specific 
heat  capacity,  c,  was  set  at  4174  J/(kg  °C),  and  the 
thermal  conductivity,  K,  was  set  at  0.60  W/(m  °C)  (as  for 
water).  The  convection  coefficient,  h,  was  set  at 
10.5  W/(m^  °C)  (see  Kritikos  et  al.  1981,  and  Wang  and 
Fujiwara  1999).  The  increase  in  ambient  temperature  in 
the  TEM  cell,  was  accounted  for  (for  empty  flasks, 
measurements  showed  a  0.16  '’C  increase  over  360  s). 

The  input  SAR  profile  is  first  calculated  in  XFDTD. 
As  well  as  the  mesh  described  previously  for  the  6  mm 
high  medium,  a  second  mesh  was  created  for  a  12  mm 
high  medium  (with  cubical  cells  of  length  1 .0  mm).  The 
measured  values  for  the  conductivity  and  relative 
permittivity  of  the  RPMI  gel  were  used.  The 
calculations  found  that,  with  an  input  of  113.5  W,  the 
peak  SAR  on  the  bottom  of  the  6  mm  and  12  mm 
medium  is  9.25  W/kg  and  67.59  W/kg,  respectively. 
The  SAR  profile  was  then  input  to  the  thermal  model. 

Figure  6  presents  a  comparison  of  two  temperature 
measurements  with  the  temperatures  calculated  by  the 
model.  The  agreement  between  the  results  provides 
confidence  in  the  accuracy  of  the  SAR  values.  Other 
measurements  gave  consistent  results.  Due  to  thermal 
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conduction  processes,  the  temperature  rise  profile  at  any 
given  height  in  the  medium  has  a  similar  shape  over  the 
360  s  period  to  that  shown  in  Figure  6  (apart  from  the 
deviation  in  the  first  few  seconds  due  to  the  particular 
SAR  at  that  position,  which  cannot  be  resolved  in  our 
measurements)  (see  also  Pickard  et  al.  1999  and  Samaras 
et  al.  2000).  However,  this  type  of  long  term 
comparison  does  confirm  the  average  SAR  value,  and  by 
using  this  as  a  reference,  also  confirms  the  value  of  SAR 


throughout  the  height  (given  our  confidence  in  the  shape 
of  the  vertical  profile  as  discussed  above). 

The  temperature  program  was  also  used  to  confirm 
that  for  2,5  W  input,  the  exposure  is  under  what  is 
deemed  ‘athermaf  conditions  (the  peak  steady-state 
temperature  rise  is  estimated  to  be  only  0.01 3®C). 


time  (s)  after  RF  on 

Figure  6.  Comparison  of  measured  and  model  temperature  rise  (0.5  mm  off  the  bottom  in  the  centre)  for  1 13.5  W  input 


6.  Conclusions 

In  vitro  studies  present  a  significant  challenge  to 
researchers  in  ensuring  that  the  SAR  of  the  exposed  cell 
culture  is  accurately  characterised.  A  TEM  cell 
exposure  chamber  provides  a  useful  system  for  in  vitro 
studies  due  to  the  uniform  layer  in  the  medium  where  the 
cell  culture  resides.  However,  due  to  the  highly  non- 
uniform  SAR  profile  in  the  vertical  direction  in  the 
medium,  it  is  critical  that  accurate  techniques  are 
employed  to  obtain  an  accurate  estimate  of  the  SAR. 
This  paper  has  shown  how  this  can  be  provided  through 
the  development  of  SAR  and  temperature  computational 
analysis  methods,  both  in  the  estimation  and  validation 
process. 
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